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OM nucleic - nucleic search, using sw model 

Run on: July 12, 2004, 14:44:07 ; Search time 6771 Seconds 

(without alignments) 
12137.162 Million cell updates/sec 

Title: US-10-624-'932-l 
Perfect score: 2752 

Sequence: 1 ccgcggggccccgcgcccgg tgagtgctgaggccggccag 2752 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 27513289 seqs, 14931090276 residues 

Total number of hits satisfying chosen parameters: 55026578 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : EST:* 

1: em_estba:* 

2: em_esthum:* 

3: em_estin:* 

4: em_estmu:* 

5: em_estov:* 

6: em_estpl:* 

7: em_estro:* 

8: em_htc:* 

9: gb_estl:* 
10: gb__est2:* 
11: gb_htc:* 
12: gb_est3:* 
13: gb_est4:* 
14: gb_est5:* 
15: em_estfun:* 
16: em__estom:* 
17: em_gss_hum:* 
IB: em_gss_inv:* 

19: em_gss_pln:* 
20: em_gss_vrt:* 

21: em_gss_fun:* 

2 2 : em_g s s_mam : * 

2 3 : em_gs s_mus : * 

24: em__gss_pro: * 

25: em_gss_rod:* 

26: em_gss_phg:* 

27: em_gss_vrl:* 



28: gb___gssl:* 
29: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
AY406491 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 



gene 



ORIGIN 



AY406491 2802 bp DNA linear GSS 15-DEC-2003 

Homo sapiens UNC5C gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence, 
AY406491 

AY406491.1 GI: 39762465 
GSS. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 

1 (bases 1 to 2802) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas,?., Ke jariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello,D.R., Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp -mouse orthologous 

gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 2802) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas,?., Ke j ariwal , A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Direct Submission 

Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. 

Location/Qualif iers 

1. .2802 

/organism="Homo sapiens" 

/mol_type="genomic DNA" 
/db_xref="taxon: 9606" 
<1. .>2802 
/gene="UNC5C" 
/locus_tag="HCM2575" 



Query Match 34.9%; Score 960.2; DB 29; Length 2802; 

Best Local Similarity 61.3%; Pred. No. 1.2e-153; 

Matches 1682; Conservative 0; Mismatches 995; Indels 69; Gaps 6; 

Qy 57 GCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGG 116 

I I I I I I I I M I I I I I I I I I I I I I I I I I 

Db 66 GCTCGTGCTACCTGCCCTGGCCCTGCTCAGCGCCAGCGGCACTGGCTCCGCCGCCCAAGA 125 

Qy 117 TGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCC 176 

M I I I I I I I I I I I I I I I I I I I I I 

Db 126 TGATGACTTTTTTCATGAACTCCCAGAAACTTTTCCTTCTGATCCACCTGAGCCTCTGCC 185 



Qy 177 CCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGT 236 

I I I I I I I I M I I I M I I I I I I I I I I I I I I I M M I I I II 
Db 186 ACATTTCCTTATTGAGCCTGAAGAAGCTTATATTGTGAAGAATAAGCCTGTGAACCTGTA 245 

Qy 237 GTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCT^GTGCAACGGGGAGTGGGTGCGCCA 2 96 

I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I II 

Db 246 CTGTAAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTGTAATAGTGAATGGGTTCATCA 305 

Qy 297 GGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGT 356 

I II II I I I I III II I I I I I I I II I I I I I I 

Db 306 GAAGGACCACATAGTAGATGAAAGAGTAGATGAAACTTCCGGTCTCATTGTCCGGGAAGT 365 

Qy 357 CCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTG 416 

II i II I III I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 366 GAGCATTGAGATTTCGCGCCAGCAAGTGGAAGAACTCTTTGGACCTGAAGATTACTGGTG 425 

Qy 417 CCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCAT 476 

I I I M I I I I I I I I I I I I I II I I I I Mill I I I I I I I I I I II II I I I I I I 
Db 42 6 CCAGTGTGTGGCCTGGAGCTCCGCGGGTACCACAAAGAGCCGGAAGGCGTATGTGCGCAT 4 85 

Qy 477 AGC CAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCT 530 

II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

Db 486 TGCATNNNNNNNNNNNCGGAAGACATTTGAGCAGGAACCCCTAGGAAAGGAAGTGTCTTT 545 

Qy 531 GGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGA 590 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I II I 

Db 546 GGAACAGGAAGTCTTACTCCAGTGTCGACCACCTGAAGGGATCCCAGTGGCTGAGGTGGA 605 

Qy 591 GTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCG 650 

II I I I I I I I I I I I I I I I III I I I I I I I I I I 

Db 606 ATGGTTGAAAAATGAAGACATAATTGATCCCGTTGAAGATCGGAATTTTTATATTACTAT 665 

Qy 651 GGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGT 710 

I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I 

Db 666 TGATCACAACCTCATCATAAAGCAGGCCCGACTCTCTGATACTGCAAATTACACCTGTGT 725 

Qy 711 GGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAA 77 0 

II I I I I I I I I I I I I I II I III I I M I I I I I I M I I I I 

Db 726 TGCCAAAAACATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATAGTCTATGTCAA 785 

Qy 771 CGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTG 830 

I I I I I I Mill II II I II I II I M I I II II I I I I II II M II M I 
Db 786 CGGTGGCTGGTCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTA 845 

Qy 831 GCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGA 890 

II I I I I I I I III II I II I II II I I I II II II I I II II I I I II I I I I I 
Db 846 TCAGAAACGTACAAGGACTTGTACCAACCCGGCACCACTCAATGGGGGTGCCTTCTGTGA 905 

Qy 891 GGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAG 950 

I II II M I I I I II I I I I I I I I I I III I I I II I I I I I I I II I I I I I I 
Db 906 AGGGCAGAGTGTGCAGAAAATAGCCTGTACTACGTTATGCCCAGTGGATGGCAGGTGGAC 965 

Qy 951 CCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTG 1010 

I I I II I II II I I I I M I M I I I II I I I II M I M I I II I I I I I I II 

Db 966 GCCATGGAGCAAGTGGTCTACTTGTGGAACTGAGTGCACCCACTGGCGCAGGAGGGAGTG 1025 



Qy 


1011 


CTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCG 

III 1 II 1 1 1 II II 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
CACGGCGCCAGCCCCCAAGAATGGAGGCAAGGACTGCGACGGCCTCGTCTTGCAATCCAA 


1070 


Db 


1026 


1085 


Qy 


1071 


CTVACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTA 

1 1 M 1 1 1 II MM 1 II 1 M M 1 II 1 M II M M II III 
GAACTGCACTGATGGGCTTTGCATGCAGACTGCTCCTGATTCAGATGATGTTGCTCTCTA 


1130 


Db 


1086 


1145 


Qy 


1131 


TGTGGG CCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCT 

1 II II 1 1 1 II 1 M II 1 1 1 II II 1 1 1 MM II 1 

TGTTGGGATTGTGATAGCAGTGATCGTTTGCCTGGCGATCTCTGTAGTTGTGGCCTTGTT 


1187 


Db 


1146 


1205 


Qy 


1188 


CGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCAC 


1247 


Db 


1206 


Mini Ml 1 1 1 M II M II 1 II II II II MM 

TGTGTATCGGAAGAATCATCGTGACTTTGAGTCAGATATTATTGACTCTTCGGCACTCAA 


1265 


Qy 


1248 


CTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCAC 

1 M 1 1 1 M 1 1 1 1 1 1 II II II 1 II 1 1 1 II 1 M 1 1 1 

TGGGGGCTTTCAGCCTGTGAACATCAAG GCAGCAAGACAAGATCTGCTGGC 


1307 


Db 


1266 


1316 


Qy 


1308 


CATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCA 
II II M M II 1 1 1 1 II 1 III II 1 1 M 1 II 1 1 II 
TGTACCCCCAGACCTCACGTCAGCTGCAGCCATGTACAGAGGACCTGTCTATGCCCTGCA 


1367 


Db 


1317 


1376 


Qy 


1368 


GGATG GGCCCAGCCCC^iAGTTCCAGCTCACCAAT GGGCACCTGCTCAGCCC 

Ml II 1 II II 1 1 1 1 M 1 M 1 II 1 1 1 1 II II 1 1 
TGACGTCTCAGACAAAATCCCAATGACCAACTCTCCAATTCTGGATCCACTGCCCAACCT 


1418 


Db 


1377 


1436 


Qy 


1419 


CCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTT 
1 1 1 II 1 1 1 II 1 III 1 II II 1 
GAAAATCAAAGTGTACAACACCTCAGGTGCTGTCACCCCCCAAGATGACCTCTCTGAGTT 


1478 


Db 


1437 


1496 


Qy 


1479 


CGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACAT 
Ml 1 1 1 II 1 II II 1 1 II 1 1 1 1 1 1 1 
TACGTCCAAGCTGTCCCCTCAGATGACCCAGTCGTTGTTGGAGAATGAAGCCCTCAGCCT 


1538 


Db 


1497 


1556 


Qy 


1539 


M 1 1 1 M 1 1 II 1 M 
GAAGAACCAGAGTCTAGCAAGGCAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCAA 


1556 


Db 


1557 


1616 


Qy 


1557 


CTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGA 
M II 1 1 M 1 II II 1 II 1 II II II 1 1 1 1 1 1 II 1 1 II 1 1 1 
CTCGCTGGGAGGTCACCTTATTGTTCCCAATTCAGGAGTCAGCTTGCTGATTCCCGCTGG 


1616 


Db 


1617 


1676 


Qy 


1617 


TGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGT 

II 1 1 1 II 1 1 M II 1 M II M M 1 1 III 1 II II 1 Ml 1 

GGCCATTCCCCAAGGGAGAGTCTACGAAATGTATGTGACTGTACACAGGAAAGAAACTAT 


1676 


Db 


1677 


1736 


Qy 

Db 


1677 
1737 


GAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCC 
1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M II III M 1 II M II II II II III 
GAGGCCACCCATGGATGACTCTCAGACACTTTTGACCCCTGTGGTGAGCTGTGGGCCCCC 


1736 
1796 


Qy 


1737 


TGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCC 

Ml M II M II M 1 II II 1 1 1 M 1 M II 1 1 1 1 II 1 1 II M II 1 

AGGAGCTCTGCTCACCCGCCCAGTCGTCCTCACTATGCATCACTGCGCAGACCCCAATAC 


1796 


Db 


1797 


1856 


Qy 


1797 


TGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCT 


1856 



Db 


1857 


CGAGGACTGGAAAATACTGCTCAAGAACCAGGCAGCACAGGGACAGTGGGAGGATGTGGT 1916 


Qy 


1857 


GCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTG 1916 

1 1 1 1 1 1 1 1 1 III II 1 1 1 1 II 1 1 M II 1 1 1 1 Mill 

GGTGGTCGGGGAGGAAAACTTCACCACCCCCTGCTACATTCAGCTGGATGCAGAGGCCTG 1976 


Db 


1917 


Qy 


1917 


CTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGC 1976 

1 II II llll IJI 1 II II 11 ijiiiiti III i 

1 II II 1 1 1 1 1 II 1 II M II II 1 II 1 1 1 1 1 1 1 II III II 

CCACATCCTCACAGAGAACCTCAGCACCTACGCCCTGGTAGGACATTCCACCACCAAAGC 2036 


Db 


1977 


Qy 


1977 


TGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTA 2036 

II 11 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 I III k t .... 

II M M 1 M II II M 1 1 1 1 1 llll III II 1 1 1 1 llll II 1 1 1 1 1 

GGCTGCGAAGCGCCTCAAGCTGGCCATCTTTGGGCCCCTGTGCTGCTCCTCGCTGGAGTA 2096 


Db 


2037 


Qy 


2037 


CAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCT 2096 

II llllli llllllll lii llll J !■■■> 

II II II 1 1 llllllll III 1 1 1 1 1 1 1 II 1 Mill II II II 1 1 1 II II 

CAGCATCCGAGTCTACTGTCTGGATGACACCCAGGATGCCCTGAAGGAAATTTTACATCT 2156 


Db 


2097 


Qy 


2097 


GGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAG 2156 

MM 1 II 1 II 1 1 M 1 1 1 1 1 1 Mill II 1 1 II 1 1 1 1 1 1 II 

TGAGAGACAGATGGGAGGACAGCTCCTAGAAGAACCTAAGGCTCTTCATTTTAAAGGCAG 2216 


Db 


2157 


Qy 


2157 


TTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCT 2216 

M M II II II 1 II 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 II 1 

CACCCACAACCTGCGCCTGTCAATTCACGATATCGCCCATTCCCTCTGGAAGAGCAAATT 2276 


Db 


2217 


Qy 


2217 


CCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTT 227 6 

III 1 M M 1 M M II II 1 1 1 1 1 llllllll III 1 III 

GCTGGCTAAATATCAGGAAATTCCATTTTACCATGTTTGGAGTGGATCTCAAAGAAACCT 2336 


Db 


2277 


Qy 


2277 


GCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCT 2336 

IllJIIillllllll >ILII I ■ iktk .... 

1 M 1 M II 1 II M II II II 1 1 Mill llll 1 II II 1 1 1 1 1 II 1 

GCACTGCACCTTCACTCTGGAAAGATTTAGCCTGAACACAGTGGAGCTGGTTTGCAAACT 2396 


Db 


2337 


Qy 


2337 


GTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAA 2396 

M 1 1 II 1 1 II 1 1 1 1 II II 1 1 1 II 1 1 II 1 1 1 1 II 1 1 M 1 II 1 1 

CTGTGTGCGGCAGGTGGAAGGAGAAGGGCAGATCTTCCAGCTCAACTGCACCGTGTCAGA 2456 


Db 


2397 


Qy 


2397 


GGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGT 2456 


Db 


2457 


Mill 1 1 1 1 II 1 1 II 1 II 1 II 1 1 
GGAACCTACTGGCATCGATTTGCCGCTGCTGGATCCTGCGAACACCATCACCACGGTCAC 2516 


Qy 


2457 


GGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGA 2516 

M II 1 M 1 1 1 M 1 II II 1 1 1 III 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

GGGGCCCAGTGCTTTCAGCATCCCTCTCCCTATCCGGCAGAAGCTCTGTAGCAGCCTGGA 257 6 


Db 


2517 


Qy 


2517 


CCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAG 2576 

1 M 1 1 II 1 1 1 1 II 1 II II II 1 1 1 II II II 1 1 1 II 1 1 1 1 1 1 
TGCCCCCCAGACGAGAGGCCATGACTGGAGGATGCTGGCCCATAAGCTGAACCTGGACAG 2636 


Db 


2577 


Qy 


2577 


CCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGA 2636 

1 II II M 1 1 1 1 1 1 M 1 M 1 M II 1 II 1 M II 1 II II 1 M 1 

GTACTTGAATTACTTTGCCACCAAATCCAGCCCAACTGGCGTAATCCTGGATCTTTGGGA 2696 


Db 


2637 


Qy 


2637 


GGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGG 2696 



1 1 I I 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I III I I 1 1 11 



Db 2697 AGCACAGAACTTCCCAGATGGAAACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGG 2756 

Qy 2697 CCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I I I I I I I I I I I I I I 
Db 2757 AAGACATGAAACGGTGGTGTCCTTAGCAGCAGAAGGGCAGTATTAA 2802 



RESULT 2 
AY406493 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



gene 



ORIGIN 



AY406493 2791 bp DNA linear GSS 15-DEC-2003 

Mus musculus UNC5C gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 
AY406493 

AY4 064 93. 1 GI: 397624 67 
GSS. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (bases 1 to 2791) 

Clark,A.G., Glanowski, S . , Nielsen, R., Thomas, P., Ke j ariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp -mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 2791) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas,?., Kej ariwal , A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D.R., Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Direct Submission 

Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. 

Location/Qualifiers 

1. .2791 

/organism="Mus musculus" 
/mol_type="genomic DNA" 
/db_xref="taxon: 10090" 
<1. .>2791 
/gene="UNC5C" 
/locus_tag="HCM2575" 



Query Match 34.5%; Score 950.4; DB 29; Length 2791; 

Best Local Similarity 62.3%; Pred. No. 5.6e-152; 

Matches 1653; Conservative 0; Mismatches 921; Indels 80; Gaps 7; 

Qy 149 TGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACA 208 

III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Dt> 158 TTCCATCTGACCCACCTGAGCCATTGCCACACTTCCTCATTGAGCCCGAGGAAGCTTACA 217 



Qy 209 TCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCT 268 



I I I I I I I I I I I I I I I I I II M I I II I I I I I I I I I I I I I I I I I 

Db 218 TTGTGAAGAACAAGCCTGTGAACCTGTATTGTA7\AGCCAGCCCTGCCACCCAGATCTACT 277 

Qy 269 TCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACG 328 

M I I I M II I I I I I I I I I I I I III I I I I M I I I I I I II I I I I 

Db 278 TCAAGTGCAACAGCGAGTGGGTTCATCAGAAGGACCACGTAGTAGACGAGAGAGTAGATG 337 

Qy 329 GGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGA 38 8 

II II I II I MM Mini 1 M I I M II II M II I 

Db 338 AAACCTCTGGTCTAATTGTGAGAGAAGTGAGCATTGAGATTTCACGCCAGCAGGTGGAGG 397 

Qy 389 AGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCA 448 

I II I I M M II II M II I I II I M I II M II I M II M II I I I I M I 

Db 398 AACTGTTTGGGCCTGAAGATTACTGGTGCCAGTGTGTGGCCTGGAGCTCAGCAGGCACTA 457 

Qy 449 CCAAGAGTCAGAAGGCCTACATCCGCATAGCCAG ATTGCGCAAGAACTTCGAGC 502 

I M II II I II II M III I M I II II I I II I I I I II I II II 

Db 458 CGAAGAGTCGGAAGGCATACGTGCGCATTGCGTTTCCAGATCTGCGGAAGACATTCGAGC 517 

Qy 503 AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 562 

I I II M III I I I M I I I I M II M I II M I I I II I I I I I II M 

Db 518 AGGAACCCTTGGGAAAGGAAGTGTCCTTGGAGCAGGT^GTCTTACTCCAGTGTCGGCCAC 577 

Qy 563 CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 622 

I I I II II II II I I I I I II II I I I II I I I II I I 

Db 57 8 CTGAAGGGATCC GTAGAATGGCTAAAGAATGAAGACATAATTGATCCTG 62 6 

Qy 623 CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 682 

I II I II I M II I I I I I M I II II I I II I I M I I 

Db 627 CTGAAGATCGGAACTTTTATATTACTATCGATCACAACCTGATCATCAAGCAAGCCCGAC 686 

Qy 683 TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 742 

I I II II II II II I M II II II I M I I M II II I III 

Db 687 TCTCAGATACAGCAAATTATACCTGTGTTGCCAAAAATATTGTTGCCAAGAGAAAAAGCA 746 

Qy 743 CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 802 

II I II I I I I M I II I M M II II I II I I I I I M Mill I II I I II I I 

Db 747 CCACAGCCACTGTCATCGTGTATGTTAATGGTGGCTGGTCCACCTGGACAGAGTGGTCTG 8 06 

Qy 803 TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 862 

I II I I I M I II II II I II I M I II II I I I I II II I I I I I I I I I 
Db 807 TGTGTAACAGCCGCTGTGGGCGAGGATATCAGAAACGCACAAGAACCTGCACCAACCCAG 866 

Qy 863 CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGT^TGTCCAGAAAACAGCCTGCGCCA 922 

I II Mill II II M I M M II II M I M II III I I II II I III III I I 

Db 8 67 CCCCACTCAATGGTGGGGCCTTCTGTGAGGGGCAGAGTGTGCAGAATU^TAGCATGCACTA 926 

Qy 923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCTyiLGTGGTCGGCCTGTGGGCTGG 982 

I I I I M II I II I I II II II I M II I I II I I M I I II II I II I 
Db 927 CGTTATGTCCAGTGGATGGTAGGTGGACTTCATGGAGCAAATGGTCAACCTGTGGGACTG 986 

Qy 983 ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 

I I M I II II II II II II I II I I I I I I M M I M II II I I I III 

Db 987 AATGCACCCACTGGCGCAGGAGGGAGTGTACAGCACCAGCCCCCAAGAACGGGGGTAAGG 104 6 

Qy 1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 

III I II I MM I II II I II M II II II III II 



Db 1047 ACTGTGATGGCCTGGTCCTCCAATCCAAGAACTGCACTGATGGGCTGTGCATGCAGGCTG 1106 

Qy 1103 CTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGG CCTCATCGCCGTGGCCGTCTGCC 1159 

M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 1107 CTCCTGACTCAGATGATGTGGCTCTCTACGTGGGGATTGTGATCGCTGTAACAGTCTGTC 1166 

Qy 1160 TGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACT 1219 

III I I I M I I I II III I Mil I MM 

Db 1167 TGGCGATCACTGTTGTGGTGGCCCTGTTTGTGTATCGGAAGAACCACCGTGACTTTGAGT 1226 

Qy 122 0 CAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCA 1279 

Ml I I I II I 11 I I I II I I I I 11 M I II II I I M I I I 
Db 1227 CTGACATCATTGACTCCTCAGCACTCAATGGCGGCTTTCAGCCTGTGAACATCAAG 1282 

Qy 1280 GCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCA 1339 

I I I II I M I M I I III I II I I I I M II I I III 
Db 1283 GCTGCCAGACAAGATCTCCTGGCTGTCCCCCCTGACCTCACCTCAGCTGCAGCCA 1337 

Qy 1340 CCTACCAGGGCAGTCTCTGTCCCCGGCAGGATG GGCCCAGCCCCAAGTTCCAGCT 1394 

Ml III II I II II 1 1 II 1 1 1 II I 1 1 1 1 1 mil 

Db 1338 TGTACAGGGGACCTGTCTATGCTCTGCATGATGTCTCAGACAAAATCCC/JVTGACCAACT 1397 

Qy 1395 CACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTC 1454 

I II I I I I MINI I I II II III 

Db 1398 CTCCAATTCTGGACCCACTACCCAACTTGAAAATCAAAGTGTACAACAGCTCAGGTGCTG 1457 

Qy 1455 TCCCACCTC TGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCC 1498 

I I I I I I I III I I I II M III I II I III 

Db 1458 TCACTCCTCAGGATGACCTTGCCGAGTTCTCATCCAAACTGTCACCCCAGATGACCCAGT 1517 

Qy 1499 AGAACTACTTCCGCTCCCTGCCCCGAGGCA 1528 

I I I I II I II I I II 

Db 1518 CCTTGCTAGAGAATGAGGCCCTTAACCTGAAGAACCAGAGCCTCGCAAGACAGACTGACC 1577 

Qy 1529 CCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATA 1588 

I I II II I I II I I I I I I I I I I II I I I M I II II I I I I I I I 
Db 1578 CATCCTGCACAGCATTTGGTACCTTCAACTCTCTTGGGGGTCACCTCATCATTCCTAATT 1637 

Qy 1589 CAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCT 1648 

MM I M II I II II I II I M II II I I I II I II I I M II II 

Db 1638 CAGGAGTAAGCTTGCTGATTCCCGCTGGGGCCATTCCTCAGGGGAGAGTCTATGAAATGT 1697 

Qy 1649 ACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGC 1708 

I III I I II I I. I I II Mill II M M II II M I 

Db 1698 ATGTGACTGTACACAGGAT^GAAAATATGAGGCCCCCCATGGAAGACTCTCAGACCCTAC 1757 

Qy 1709 TGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGG 1768 

II M M M I II I M II II I II II III I II II I II II M II I M 

Db 1758 TTACCCCTGTGGTGAGCTGTGGGCCTCCTGGAGCTCTGCTGACCCGCCCTGTCATCCTCA 1817 

Qy 17 69 CTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGT 1828 

I I I I I I II I I M II II M II II I I II II M II II II II II I 
Db 1818 CTCTGCATCACTGTGCAGACCCCAGCACCGAGGACTGGAAGATCCAGCTCAAAAACCAGG 1877 

Qy 1829 CGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCT 1888 

I MM II M II I II II II II M II II I II I II I 

Db 1878 CAGTGCAGGGACAATGGGAGGATGTTGTGGTGGTTGGGGAGGAGAACTTCACAACCCCCT 1937 



Qy 


1889 


ACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTG 

1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 

GTTACATTCAGCTGGATGCAGAGGCTTGCCATATCCTCACAGAGAACCTCAGTACCTATG 


1948 


Db 


1938 


1997 


Qy 


1949 


CCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTG 

1 1 1 M 1 1 1 1 1 1 1 1 III II II 1 1 M 1 1 II 1 1 II II 1 1 1 1 1 1 

CCCTGGTTGGGCAGTCCACCACCAAAGCAGCTGCCAAGCGTCTTAAACTGGCCATCTTTG 


2008 


Db 


1998 


2057 


Qy 


2009 


CGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCC 

III 1 Ml! 1 Mill II 1 1 1 II III 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 II 1 

GGCCCCTCTGCTGCTCTTCCCTGGAGTACAGCATTAGAGTCTACTGCCTGGATGACACAC 


2068 


Db 


2058 


2117 


Qy 


2069 


ACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGG 
1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 II 
AGGATGCCCTGAAGGAAGTTCTACAACTGGAGAGGCAAATGGGAGGACAGCTCCTAGAAG 


2128 


Db 


2118 


2177 


Qy 


2129 


AGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATG 
1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 
AACCCAAGGCTCTTCATTTTAAAGGCAGCATCCACAACCTGCGCCTGTCTATTCATGACA 


2188 


Db 


2178 


2237 


Qy 


2189 


TGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATC 
1 M 1 II 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 II M II M 1 1 1 II 1 
TCGCCCATTCCCTCTGGAAGAGCAAATTGCTGGCTAAGTATCAGGAAATTCCATTTTACC 


2248 


Db 


2238 


2297 


Qy 


2249 


ACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCC 

1 1 M II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 

ACATCTGGAGTGGCTCTCAAAGAAACCTCCACTGCACCTTCACTCTGGAAAGACTCAGCC 


2308 


Db 


2298 


2357 


Qy 


2309 


CCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGA 
1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 II II 1 1 II II II 1 II 1 1 II 
TAAACACAGTGGAACTGGTTTGCAAACTCTGTGTGCGGCAGGTTGAAGGAGAAGGGCAGA 


2368 


Db 


2358 


2417 


Qy 


2369 


GCTTCAGCATCT^CTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGG 
1 1 1 1 1 1 1 1 II 1 1 1 INI II II 1 1 1 II 1 
TCTTCCAGCTCAACTGTACTGTGTCAGAGGAACCTACTGGCATCGACTTACCTCTCCTGG 


2428 


Db 


2418 


2477 


Qy 


2429 


AGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCA 
1 II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 III 1 
ACCCTGCTAGTACCATCACCACTGTCACCGGACCAAGTGCTTTCAGCATTCCTCTCCCTA 


2488 


Db 


2478 


2537 


Qy 


2489 


TTCGGCAGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGA 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 M III 1 III 1 1 1 M 1 1 1 1 
TCCGGCAGAAGCTATGCAGCAGCCTGGATGCCCCTCAAACAAGAGGCCATGACTGGAGGA 


2548 


Db 


2538 


2597 


Qy 


2549 


CTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCC 
1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 11 1 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 
TGCTGGCCCATAAACTCAACCTGGACAGGTACTTGAATTACTTTGCCACCAAATCGAGCC 


2608 


Db 


2598 


2657 


Qy 


2609 


CCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCC 
1 11 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II II 1 1 1 1 1 1 
CAACTGGCGTAATCCTGGATCTTTGGGAAGCACAGAACTTCCCAGATGGAAACCTGAGCA 


2668 


Db 


2658 


2717 


Qy 


2669 


AGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGG 
1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 11 11 1 1 1 1 II 
TGCTGGCAGCCGTCCTGGAAGAAATGGGAAGACATGAGACAGTGGTGTCCTTGGCAGCAG 


2728 


Db 


2718 


2777 



Qy 2729 AGGCTGAGTGCTGA 2742 

II III III 
Db 2778 AAGGACAGTATTGA 2791 



RESULT 3 

AK031655 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 



AK031655 3790 bp mRNA linear HTC 18-SEP-2003 

Mus musculus 13 days embryo male testis cDNA, RIKEN full-length 
enriched library, clone : 6030473H24 product:unc5 homolog (C. 
elegans) 3, full insert sequence. 
AK031655 

AK031655. 1 GI: 26327502 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 

Carninci^P. and Hayashizaki^ Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata.Y., Hayatsu,N., Sugahara,Y., Shibata^K., 

Itoh.M., Konno^H., Okazaki,Y., Muramatsu,M. and Hayashizaki,Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata^K., Itoh,M., 7Vizawa,K., Nagaoka,S., Sasaki, N., Carninci,P., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M. , Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki,Y. 
RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409, 685-690 (2001) 

5 

The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 3790) 



AUTHORS Adachi,J.^ Aizawa^K., Akimura,T.^ Arakawa^T.^ Bono^H., Carninci,P., 
Fukuda^S., Furuno,M.^ Hanagaki^T., Kara, A. , Hashizume, W. , 
Hayashida^ K. , Hayatsu^N., Hiramoto, K. , Hiraoka,T., Hirozane^T., 
Hori,F.^ Imotani, K. , Ishii,Y., Itoh,M. , Kagawa^I,^ Kasukawa,T., 
Katoh^H.y. Kawai^J., Kojima,Y.^ Kondo^S., Konno^H., Kouda,M., 
Koya^S., Kurihara,C., Matsuyama, T . ^ Miyazaki^A.^ Murata,M., 
Nakamura,M. , Nishi,K»^ Nomura, K., Numazaki, R. , Ohno,M., Ohsato^N., 
Okazaki^Y., Saito,R., Saitoh, H., Sakai^C, Sakai,K., Sakazume^N. ^ 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A. , Takahashi , F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted (16-JUL-2001) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res @gsc . riken . go . jp, 
URL :http: //genome. gsc. riken. go. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL: http : // genome . gsc. riken . go . jp/ 
URL:http : //fantom. gsc. riken. go. jp/ . 
FEATURES Location/Qualifiers 
source 1. .3790 

/organism="Mus mus cuius" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_xr e f = " FANTOM_DB : 6 0 3 0 4 7 3H2 4 " 

/db_xref="MGI: 2394911" 

/db_xref="taxon: 10090" 

/clone="6030473H24" 

/sex="male" 

/tissue_type="testis " 

/clone_lib=" RIKEN full-length enriched mouse cDNA library" 
/dev_stage="13 days embryo" 
CDS 118. .2970 

/note= "unnamed protein product; putative 

unc5 homolog (C. elegans) 3 (MGD | MGI : 1095412, 

GB1NM_009472, evidence: BLASTN, 99%, match=464)" 

/codon_start=l 

/protein_id="BAC27495. 1" 

/db_xref="GI : 26327503" 

/translation="MRKGLRATAARCGLGIGYLLQMLVLPALALLSASGTGSAAQDDE 
FFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYCKASPATQIYFKCNSEWVHQ 
KDHWDERVDETSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYV 
RIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEDIIDPAEDRNFYI 
TIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRC 
GRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTSWSKWSTCGTECTH 
WRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQGFIYPISTEHRPQNEYGFSSAPD 
SDDVALYVGIVIAVTVCLAITVWALFVYRKNHRDFESDIIDSSALNGGFQPVNIKAA 
RQDLLAVPPDLTSAAAMYRGPVYALHDVSDKIPMTNSPILDPLPNLKIKVYNSSGAVT 



PQDDLAEFSSKLSPQMTQSLLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPN 
SGVSLLIPAGAIPQGRVYEMYVTVHRKENMRPPNEDSQTLLTPWSCGPPGALLTRPV 
ILTLHHCADPSTEDWKIQLKNQAVQGQWEDWWGEENFTTPCYIQLDAEACHILTEN 
LSTYALVGQSTTKAAAKRLKLAIFGPLCCSSLEYSIRVYCLDDTQDALKEVLQLERQM 
GGQLLEEPKALRFKGSIHNLRLSIHDIAHSLWKSKLLAKYQEIPFYHIWSGSQRNLHC 
TFTLERLSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDPASTITTVT 
GPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLNLDRYLNYFATKSSPTGVILDL 
WEAQNFPDGNLSMLAAVLEEMGRHETWYLAAEGQY " 

ORIGIN 

Query Match 33.6%; Score 923.4; DB 11; Length 37 90; 

Best Local Similarity 61.3%; Pred. No. 2.7e-147; 

Matches 1659; Conservative 0; Mismatches 926; Indels 120; Gaps 6; 

Qy 149 TGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACA 208 

III III I I I I I I I I I I II II II I I I I I II II I I I I I I II 
Db 275 TTCCATCTGACCCACCTGAGCCATTGCCACACTTCCTCATTGAGCCCGAGGAAGCTTACA 334 

Qy 209 TCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCT 268 

I I I I I I I I II I I II III M I I I I I I I M I M I I I II II I I I I 

Db 335 TTGTGAAGAACAAGCCTGTGAACCTGTATTGTAAAGCCAGCCCTGCCACCCAGATCTACT 394 

Qy 269 TCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACG 328 

I I I I I I I I I I I I I I II I I I I I III I I I I I I II I III II II 1 I 

Db 395 TCAAGTGCAACAGCGAGTGGGTTCATCAGAAGGACCACGTAGTAGACGAGAGAGTAGATG 454 

Qy 32 9 GGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGA 38 8 

II I I I I I I II I I M I II I I M I I I I I I I I I I I II 

Db 455 AAACCTCTGGTCTAATTGTGAGAGAAGTGAGCATTGAGATTTCACGCCAGCAGGTGGAGG 514 

Qy 389 AGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCA 448 

I I I I I I I I I II II I I I I II I I M I I I I I I I I I M I I I I I I I I II II I 
Db 515 AACTGTTTGGGCCTGAAGATTACTGGTGCCAGTGTGTGGCCTGGAGCTCAGCAGGCACTA 574 

Qy 449 CCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGC 508 

I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I 

Db 575 CGAAGAGTCGGAAGGCATACGTGCGCATTGCGTATCTGCGGAAGACATTCGAGCAGGAAC 634 

Qy 509 CGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGG 568 

I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 635 CCTTGGGAAAGGAAGTGTCCTTGGAGCAGG/SAGTCTTACTCCAGTGTCGGCCACCTGAAG 694 

Qy 569 GCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGG 628 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 695 GGATCCCAGTGGCTGAGGTAGAATGGCTAAAGAATGAAGACATAATTGATCCTGCTGAAG 754 

Qy 629 ACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTG 688 

I I II I I I I I I I I I I I II I I M I I I I I I I II I I II 

Db 755 ATCGGAACTTTTATATTACTATCGATCACAACCTGATCATCAAGC7VAGCCCGACTCTCAG 814 

Qy 689 ACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCG 74 8 

I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I 

Db 815 ATACAGCAAATTATACCTGTGTTGCCAAAAATATTGTTGCCAAGAGAAAAAGCACCACAG 874 

Qy 749 CTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCA 808 

I I I I I I I I I I I M II II I I I I I I II I I II Mill I I M II II II II I 

Db 875 CCACTGTCATCGTGTATGTTAATGGTGGCTGGTCCACCTGGACAGAGTGGTCTGTGTGTA 934 



Qy 
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Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



809 



935 



869 



995 



929 



GCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCA?VCCCGGCGCCTC 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
ACAGCCGCTGTGGGCGAGGATATCAGAAACGCACAAGAACCTGCACCAACCCAGCCCCAC 



868 



994 



928 



TCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGT 

I I I I II II II I II I II M II M I I I I III II I I I I I III III I II I I 

TCAATGGTGGGGCCTTCTGTGAGGGGCAGAGTGTGCAGAAAATAGCATGCACTACGTTAT 1054 



988 



GCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCA 

I I I I II II II II II II I II I I M I I I I I I I I I I I I I I I I I I I M 
1055 GTCCAGTGGATGGTAGGTGGACTTCATGGAGCAAATGGTCAACCTGTGGGACTGAATGCA 1114 

989 CCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCC 104 8 

I M I I I I II I II I I I I I I II I I I M II I I I I I I I I I I II I I 
1115 CCCACTGGCGCAGGAGGGAGTGTACAGCACCAGCCCCCAAGAACGGGGGTAAGGACTGTG 1174 

1049 AGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAG 1100 

I I I I I I I I I II I I I II II II I I I I III I 
1175 ATGGCCTGGTCCTCCAATCCAAGAACTGCACTGATGGGCTGTGCATGCAGGGATTCATTT 1234 

1101 TGCTTCTGGCC 1111 

I I I I III I 

1235 ACCCCATTTCAACTGAGCACAGACCCCAGAATGAATATGGATTTTCTTCTGCTCCTGACT 1294 

1112 CTGAGGACGTGGCCCTCTATGTGGG CCTCATCGCCGTGGCCGTCTGCCTGGTCCTGC 1168 

I I I I I I II I I I I I I I I II I I I I I M I I I I II I I I I I I I I 

12 95 CAGATGATGTGGCTCTCTACGTGGGGATTGTGATCGCTGTAACAGTCTGTCTGGCGATCA 1354 

1169 TGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGG 1228 

I III II I I I I II I III I I I I I M I I I 

1355 CTGTTGTGGTGGCCCTGTTTGTGTATCGGAAGAACCACCGTGACTTTGAGTCTGACATCA 1414 

1229 CTGACTCGTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAG 12 8 8 

I M I I I M I I I I I II I I I I I I I M I I I M II I III 
1415 TTGACTCCTCAGCACTCAATGGCGGCTTTCAGCCTGTGAACATCAAG GCTG 1465 

1289 ACAACCCCCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGG 134 8 

II I I I I I I I I III I I I I I I I I I I I I I III Mi II 
1466 CCAGACAAGATCTCCTGGCTGTCCCCCCTGACCTCACCTCAGCTGCAGCCATGTACAGGG 1525 

1349 GCAGTCTCTGTCCCCGGCAGGATG GGCCCAGCCCCAAGTTCCAGCTCACCAATGG 1403 

I I I I I I I I I II I I I I II I Mill II I II I II II I 

1526 GACCTGTCTATGCTCTGCATGATGTCTCAGACAA73ATCCCAATGACCAACTCTCCAATTC 1585 

1404 GCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTC 14 63 

I I II II I I I I II I I M II I II II 

1586 TGGACCCACTACCCAACTTGAAAATCAAAGTGTACAACAGCTCAGGTGCTGTCACTCCTC 1645 

1464 TGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCC 1498 

III I II II II Ml II I I Ml 

1646 AGGATGACCTTGCCGAGTTCTCATCCAAACTGTCACCCCAGATGACCCAGTCCTTGCTAG 1705 



1499 



1537 



AGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACA 

Mini I I I I I III III 

1706 AGAATGAGGCCCTTAACCTGAAGAACCAGAGCCTCGCAAGACAGACTGACCCATCCTGCA 1765 



1538 TGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCA 1597 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

17 66 CAGCATTTGGTACCTTCT^CTCTCTTGGGGGTCACCTCATCATTCCTAATTCAGGAGTAA 1825 

1598 GCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGC 1657 

M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I III 
1826 GCTTGCTGATTCCCGCTGGGGCCATTCCTCAGGGGAGAGTCTATGAAATGTATGTGACTG 1885 

1658 TGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCA 1717 

I II II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M 

1886 TACACAGGAAAGAAAATATGAGGCCCCCCATGGAAGACTCTCAGACCCTACTTACCCCTG 1945 

1718 TCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACC 1777 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I 
1946 TGGTGAGCTGTGGGCCTCCTGGAGCTCTGCTGACCCGCCCTGTCATCCTCACTCTGCATC 2005 

1778 ACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGG 1837 

I I I I I I I I I I I I I I I I I Mill II I II I I I I I I I I I III 

2006 ACTGTGCAGACCCCAGCACCGAGGACTGGAAGATCCAGCTCAAAAACCAGGCAGTGCAGG 2065 

1838 GCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCC 1897 

I I I I I I M II II II I I I M I I II II I I I I Ml I 

2066 GACAATGGGAGGATGTTGTGGTGGTTGGGGAGGAGAACTTCACAACCCCCTGTTACATTC 2125 

18 98 AGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGG 1957 

I II I I I I II I I II I I II II I I I I I I I I I II I I I II II II I 

2126 AGCTGGATGCAGAGGCTTGCCATATCCTCACAGAGAACCTCAGTACCTATGCCCTGGTTG 2185 

1958 GAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGG 2017 

I I I II III M II II II II II II II I I I I II I I II I I 

2186 GGCAGTCCACCACCAAAGCAGCTGCCAAGCGTCTTAAACTGGCCATCTTTGGGCCCCTCT 2245 

2 018 CCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCAC 2077 

MM I Mill I II II II II I I I M I II I I M I I I II II II II I I I I I I 

2246 GCTGCTCTTCCCTGGAGTACAGCATTAGAGTCTACTGCCTGGATGACACACAGGATGCCC 2305 

2078 TCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGG 2137 

I M I I I II I I I I II II II M I I M I II II I I II I Mill II 
2306 TGAAGGAAGTTCTACAACTGGAGAGGCAAATGGGAGGACAGCTCCTAGAAGAACCCAAGG 2365 

2138 TCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCT 2197 

Ml M II I II I II II I I I I I I I I I I I I I I I II II I II I 

2366 CTCTTCGTTTTAAAGGCAGCATCCACAACCTGCGCCTGTCTATTCATGACATCGCCCATT 2425 
2198 CCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGA 2257 

II II II I I M M I I I II I I II I II M M II II I M II II I I II I I 
2426 CCCTCTGGAAGAGCAAATTGCTGGCTAAGTATCAGGAAATTCCATTTTACCACATCTGGA 2485 

2258 ATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTA 2317 

II I I I I I I M I I I M I II I I II I I I II I I I I M M I I MM 
2486 GTGGCTCTCAAAGAAACCTCCACTGCACCTTCACTCTGGAAAGACTCAGCCTAAACACAG 2545 

2318 GTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCA 2377 

M MM MM II M III M II M I II II II II II M I MM 

2546 TGGAACTGGTTTGCAAACTCTGTGTGCGGCAGGTTGAAGGAGAAGGGCAGATCTTCCAGC 2605 
2378 TCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAG 2437 



Db 2606 TCAACTGTACTGTGTCAGAGGAACCTACTGGCATCGACTTACCTCTCCTGGACCCTGCTA 2665 

Qy 2438 CGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGA 2497 

Ml I I I I M I I I I I I I M Mil III II II M II I 

Db 2666 GTACCATCACCACTGTCACCGGACCAAGTGCTTTCAGCATTCCTCTCCCTATCCGGCAGA 2725 

Qy 2498 AGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCC 2557 

II II I II I I I II I III I III II M II II I II I M I I 

Db 2726 AGCTATGCAGCAGCCTGGATGCCCCTCAAACTU^GAGGCCATGACTGGAGGATGCTGGCCC 2785 

Qy 2558 AGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCA 2617 

I II M II II I M M II I I II I I II I I II M I I I I I I II M I I 

Db 27 86 ATAAACTCAACCTGGACAGGTACTTGAATTACTTTGCCACCAAATCGAGCCCAACTGGCG 2845 

Qy 2618 TGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTG 2677 

I II I II I II II I II II I I II II I II I M II II I II I I II I II I 

Db 2846 TAATCCTGGATCTTTGGGAAGCACAGAACTTCCCAGATGGAAACCTGAGCATGCTGGCAG 2905 

Qy 2678 CAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGT 2737 

II II I I I I M I I II I I III I I I I I Ml 

Db 2906 CCGTCCTGGAAGAAATGGGAAGACATGAGACAGTGGTGTACTTGGCAGCAGAAGGACAGT 2965 

Qy 2738 GCTGA 2742 

I I I 

Db 2966 ATTGA 2970 
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AK018177 3866 bp mRNA linear HTC 20-SEP-2003 

Mus musculus adult male medulla oblongata cDNA, RIKEN full-length 
enriched library, clone : 63304 15E02 product : TR7\NSMEMBRANE RECEPTOR 
UNC5H2 homolog [Rattus norvegicus], full insert sequence. 
AK018177 

AK018177.1 GI: 12857775 
HTC; CAP trapper. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 

1 

Carninci^P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,?., Shibata^Y., Hayatsu^N., Sugahara^Y., Shibata,K., 

Itoh,M., Konno^H., Okazaki^Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K., Itoh,M. , Aizawa,K., Nagaoka,S., Sasaki, N., Carninci,P., 



Konno,H., Akiyama, J. , Nishi^K,, Kitsunai^T., Tashiro^H., Itoh,M., 
Sumi^N., Ishii,Y., Nakamura, S , , Hazama,M.^ Nishine,T., Harada^A. , 
Yamamoto, R. , Mat sumo t o, H. , Sakaguchi, S . , Ikegami,T.^ Kashiwagi^ K. ^ 
Fujiwake,S., Inoue^K., Togawa,Y,^ Izawa,M. , Ohara,E., Watahiki,M., 
Yoneda^Y.;, Ishikawa^T., Ozawa,K.^ Tanaka,T., Matsuura,S., Kawai^J., 
Okazaki^Y., Muramatsu, M. ^ Inoue,Y., Kira,A. and Hayashizaki, Y. 

TITLE RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 

JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 

MEDLINE 20530913 
PUBMED 11076861 
REFERENCE 4 

AUTHORS The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409, 685-690 (2001) 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 3866) 

AUTHORS Adachi,J., Aizawa,K., Akahira,S,, Akimura,T., Aral, A. , Aono,H., 
Arakawa,T., Bono,H., Carninci,P., Fukuda,S., Fukunishi, Y. , 
Furuno,M., Hanagaki,T., Kara, A. , Hayatsu,N., Hiramoto,K., 
Hiraoka,T., Hori,F., Imotani,K., Ishii,Y., Itoh,M., Izawa,M., 
Kasukawa,T., Kato,H., Kawai,J., Kojima,Y., Konno,H., Kouda,M. , 
Koya,S., Kurihara,C., Matsuyama , T . , Miyazaki,A., Nishi,K., 
Nomura, K., Numazaki,R., Ohno,M. , Okazaki,Y., Okido,T., Owa,C., 
Saito,H., Saito,R., Sakai,C., Sakai,K., Sano,H., Sasaki, D., 
Shibata,K., Shibata,Y., Shinagawa, A. , Shiraki,T., Sogabe,Y., 
Suzuki, H., Tagami,M. , Tagawa,A., Takahashi, F. , Tanaka,T., 
Tejima,Y., Toya,T., Yamamura,T., Yasunishi,A. , Yoshida,K., 
Yoshino,M., Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted ( 10- JUL-2 000 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc. riken . go . jp, 
URL :http: //genome. gsc. riken. go. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 

COMMENT Please visit our web site (http://genome.gsc.riken.go.jp/) for 

further details. 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. First strand cDNA was primed with a primer 
[5* GAGAGAGAGAAGGATCCAAGAGCTCTTTTTTTTTTTTTTTTVN 3'], cDNA was 
prepared by using trehalose thermo-activated reverse transcriptase 
and subsequently enriched for full-length by cap-trapper. cDNA went 
through one round of normalization to Rot = 10.0 and subtraction to 
Rot = 100.0. Second strand cDNA was prepared with the primer 
adapter of sequence [5' 

GAGAGAGAGATTCTCGAGTTAATTAAATTAATCCCCCCCCCCCCC 3 ' ] . cDNA was cleaved 



FEATURES 

source 



CDS 



with BamHI and Xhol. Vector: a modified pBluescript KS(+) after 
bulk excision from Lambda FLC I. Cloning sites, 5' end: Sail; 3' 
end: BamHI. Host: DHlOB. 

Location/Qualifiers 

1. .3866 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 
/db_xref="FANTOM_DB:6330415E02" 
/db_xref="MGI : 1910526" 
/db_xref="taxon: 10090" 
/clone="6330415E02" 
/sex="male" 

/ 1 i s sue_t ype= "medul 1 a obi ongat a " 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev__stage=" adult" 
417. .3254 

/note="unnamed protein product; TRANSMEMBRANE RECEPTOR 

UNC5H2 homolog [Rattus norvegicus] ( SPTR | 008722 , evidence: 

FASTY, 96.5%ID, 100%length, match=2835) 

putative " 

/codon_start=l 

/protein_id="BAB31108 . 1" 

/db_xref ="GI : 12857776" 

/translation="MRARSGVRSALLLALLLCWDPTPSLAGVDSAGQVLPDSYPSAPA 
EQLPYFLLEPQDAYIVKNKPVELHCRAFPATQIYFKCNGEWVSQNDHVTQESLDEATG 
LRVREVQIEVSRQQVEELFGLEDYWCQCVAWSSSGTTKSRRAYIRIAYLRKNFDQEPL 
AKEVPLDHEVLLQCRPPEGVPVAEVEWLKNEDVIDPAQDTNFLLTIDHNLIIRQARLS 
DTANYTCVAKNIVAKRRSTAATVIVYVNGGWSSWAEWSPCSNRCGRGWQKRTRTCTNP 
APLNGGAFCEGQAFQKTACTTVCPVDGAWTEWSKWSACSTECAHWRSRECMAPPPQNG 
GRDCSGTLLDSKNCTDGLCVLNQRTLNDPKSHPLETSGDVALYAGLWAVFVWAVLM 
AEGVIVYRRNCRDFDTDITDSSAALTGGFHPVNFKTARPNNPQLLHPSAPPDLTASAG 
lYRGPWALQDSADKIPMTNSPLLDPLPSLKIKVYNSSTIGSGSGLADGADLLGVLPP 
GTYPGDFSRDTHFLHLRSASLGSQHLLGLPRDPSSSVSGTFGCLGGRLSLPGTGVSLL 
VPNGAIPQGKFYDLYLHINKAESTLPLSEGSQTVLSPSVTCGPTGLLLCRPWLTVPH 
CAEVIAGDWIFQLKTQAHQGHWEEWTLDEETLNTPCYCQLEAKSCHILLDQLGSYVF 
MGESYSRSAVKRLQLAIFAPALCTSLEYSLRVYCLEDTPVALKEVLELERTLGGYLVE 
EPKPLLFKDSYHNLRLSLHDIPHAHWRSKLLAKYQEIPFYHVWNGSQRALHCTFTLER 
HSLASTEFTCKVCVRQVEGEGQIFQLHTTLAETPAGSLDALCSAPGNAITTQLGPYAF 
KIPLSIRQKICSSLDAPDSRGNDWRLLAQKLSMDRYLNYFATKASPTGVILDLWEARQ 
QDDGDLNSLASALEEMGKSEMLVAMATDGDC" 



ORIGIN 



Query Match 31.7%; Score 872.4; DB 11; Length 3866; 

Best Local Similarity 60.6%; Pred. No. 1.4e-138; 

Matches 1646; Conservative 0; Mismatches 941; Indels 129; Gaps 



Qy 

Db 



157 GCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAG 216 

III II II I I I I I I I I II I M M I I M I II I I I II M I I I I II I 

540 GCCCCTGCGGAGCAGCTGCCGTACTTCCTATTGGAGCCACAGGACGCCTACATCGTAAAG 599 



Qy 

Db 



217 AACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGC 276 

I I I I M I I M I I M I I II I II I I I I M I I I I II I II I II II I I II I 

600 AACAAGCCAGTGGAACTGCACTGCAGAGCCTTCCCTGCCACGCAGATCTACTTCAAGTGT 659 



Qy 



277 AACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGT 336 
I I I I I I I I I I II I I I II I I I I I I I I I II III I I I I II 



660 AATGGCGAGTGGGTCAGCCAGAATGACCACGTCACACAGGAGAGCCTGGATGAGGCCACA 719 

337 GGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTC 396 

II III II I I I I II I I I I I I I I I I I II II I I I I I I I I 

720 GGCTTGCGGGTGCGAGAGGTGCAGATCGAGGTGTCACGGCAGCAAGTGGAGGAACTCTTC 779 

397 GGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGT 456 

Mill I I I I I II I I I II II I I I I II I II II II I I II I I I I I II M M I I I II M 

780 GGGCTCGAGGACTACTGGTGCCAGTGCGTGGCCTGGAGCTCTTCGGGAACTACCAAGAGT 839 

457 CAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCC 516 

I I I I I I I I M I I I I I III I I I I I M I II I I II II I I I I I I I I I I I I II 

840 CGCCGAGCCTACATCCGCATTGCCTACTTGCGCAAGAACTTTGACCAGGAGCCTCTGGCC 899 

517 AAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCT 576 

II II I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I II I I I I 

900 AAGGAGGTACCCTTGGATCATGAGGTCCTTCTGCAGTGCCGCCCACCGGAGGGAGTGCCT 959 

577 CCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAAT 636 

I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I II I II 
960 GTGGCTGAGGTGGAATGGCTCAAGAATGAAGATGTCATTGACCCCGCTCAGGACACTAAC 1019 

637 GTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCC 696 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

102 0 TTCCTGCTCACCATTGACCACAACCTCATCATCCGCCAGGCGCGCCTCTCAGACACGGCC 1079 

697 AACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTC 756 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I Mill Ml II I II I III 
1080 AACTACACCTGTGTGGCCAAGAATATCGTGGCCAAGCGCCGGAGCACCGCGGCCACAGTC 1139 

757 ATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGC 816 

II M M I I I II I I I I I I I I II I I I I I I II II M M III I II 
1140 ATCGTCTATGTGAATGGAGGCTGGTCCAGCTGGGCAGAGTGGTCACCCTGTTCCAATCGC 1199 

817 TGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGG 87 6 

I I II I II I I M M I I II I II I I I I I M II I I I I I M II II Mill II 

1200 TGTGGCCGAGGCTGGCAGAAGCGTACTCGGACCTGCACCAATCCAGCCCCACTCAATGGA 1259 

877 GGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTA 936 

II I I I M I M II II I I III II M I II II II I III Mill I II I II I M I 

1260 GGCGCCTTCTGTGAGGGACAGGCCTTCCAGAAGACAGCTTGCACCACCGTGTGCCCAGTG 1319 

937 GACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGG 996 

MM II II I II I II II II I II I M I M II I MM I I I M I I 

1320 GATGGAGCGTGGACCGAGTGGAGCAAGTGGTCTGCCTGCAGCACAGAGTGTGCGCACTGG 1379 
997 CGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACT 1056 

I I M M I I I M II I II I M I II I II I II M II II I MM 

1380 CGCAGCCGCGAGTGCATGGCACCGCCACCCCAGAACGGAGGCCGTGACTGCAGCGGGACG 1439 

1057 GACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCC 1111 

M I II II II II I I I M I I II I I I II I 
1440 CTACTTGACTCCAAGAACTGCACTGATGGGCTGTGCGTGCTGAATCAGAGAACTCTAAAC 1499 

3^112 CTGAGGACGTGGCCCTCTATGTGGGCCTCATC 1143 

II M I I II I M I I I Mill I 

1500 GACCCTAAAAGCCACCCCCTGGAGACATCGGGAGATGTGGCACTGTACGCAGGCCTTGTG 1559 



Qy 1144 GCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTC— CTCATCCTCGTTTATTGCCGG 1200 

I I I I I I I I I I I M I I I I I I I I I I I I 

Db 1560 GTGGCCGTCTTTGTGGTGGTAGCGGTTCTCATGGCCGAGGGAGTGATCGTATACCGGAGA 1619 

Qy 1201 AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGT CCATTCTCACCTCAGGCTTC 1257 

II M I I I I I I I I I I M I I I I Mill I I II I I 

Db 1620 AACTGCCGGGACTTCGACACGGACATCACCGACTCCTCTGCGGCCCTCACTGGTGGCTTC 167 9 

Qy 1258 CAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCT CACCATC 1311 

II II II II I II I II I I I I I I M I I II I I I I II I 

Db 1680 CACCCTGTCAACTTCAAGACTGCAAGGCCCAAGAACCCGCAGCTCCTGCACCCGTCCGCC 1739 

Qy 1312 CAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGA- 137 0 

I II II II I I III I I I I I I I I II I I I I II I I I I I I I 

Db 1740 CCTCCAGACCTAACGGCCAGTGCTGGCATCTACCGCGGGCCTGTGTATGCCCTGCAGGAC 1799 

Qy 1371 TGGGCCCAGCCCCAAG 1386 

I I I I I I I I I I I I 

Db 1800 TCCGCCGACAAGATCCCCATGACTAATTCGCCCCTGCTGGATCCCCTGCCCAGCCTCAAG 1859 

Qy 1387 TTCCAGCTCACCAATGGGCACCTGCTCAG C 1416 

I I I I I I I I I I I I I 

Db 1860 ATCAAGGTCTATAACTCCAGCACCATCGGTTCTGGGTCTGGCCTGGCTGATGGAGCCGAC 1919 

Qy 1417 CCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAG 1476 

I I I I I I I I I M I I I 11 II I III II III 

Db 1920 CTGCTGGGTGTCCTCCCGCCGGGCACGTACCCAGGCGATTTCTCCCGGGACACCCATTTC 1979 

Qy 1477 TTCGTCTCCCGCCTCTCCACCC AGAACTACTTCCGCTCCCTGCCCCGAGGCACC 1530 

I I III I I I I I I I I I I I M I I I I I I I I I 

Db 1980 CTGCACCTGCGCAGTGCCAGCCTTGGTTCCCAGCACCTCCTGGGCCTACCTCGGGACCCC 2 039 

Qy 1531 AGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACA 1590 

M I I II I I I I I I I I I I I I I I I I M I I I I II I I III 

Db 2040 AGCAGCAGTGTCAGCGGCACCTTTGGTTGCCTGGGAGGAAGGCTGAGCCTCCCCGGCACA 2099 

Qy 1591 GGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTAC 1650 

II I I I I I M I III I I I I I I I I I I I I I I I I I I I I I I III 

Db 2100 GGGGTCAGCCTGTTGGTACCAAATGGAGCCATTCCCCAGGGCAAGTTCTATGACCTGTAT 2159 

Qy 1651 CTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTG 1710 

II I I I I I I I I I I I I I I M I I I I I I I I I 

Db 2160 CTACATATCAACAAGGCCGAAAGCACCCTCCCACTTTCAGAAGGTTCCCAGACAGTATTG 2219 

Qy 1711 AGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCT 177 0 

I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I II I I 

Db 2220 AGCCCCTCGGTGACCTGTGGGCCCACAGGCCTACTCCTGTGCCGCCCTGTCGTCCTCACC 227 9 

Qy 1771 ATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCG 1830 

II 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 I I I mill I I I I 

Db 2280 GTGCCCCACTGTGCTGAAGTCATCGCTGGAGACTGGATCTTTCAGCTCAAGACCCAGGCC 2339 

Qy 1831 TGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTAC 1890 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I INI 

Db 2340 CATCAGGGCCACTGGGAGGAGGTGGTGACCTTGGATGAGGAGACCCTCAACACACCCTGC 2399 



Qy 


1891 


TACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
TACTGCCAGCTGGAGGCTAAGTCCTGCCACATCCTGCTGGACCAGCTGGGTTCCTACGTA 


1950 


Db 


2400 


2459 


Qy 


1951 


CTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCG 

1 1 1 1 I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TTCATGGGCGAGTCCTACTCTCGCTCTGCAGTCAAGCGGCTCCAGCTGGCCATCTTCGCC 


2010 


Db 


2460 


2519 


Qy 


2011 


CCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCAC 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 

CCAGCCCTCTGCACCTCCCTGGAGTATAGCCTCAGGGTCTACTGTCTGGAGGACACACCT 


2070 


Db 


2520 


2579 


Qy 


2071 


GATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAG 

1 1 II II II II II II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 III 1 1 1 M 
GTAGCACTGAAGGAGGTCCTGGAGCTGGAGAGGACTCTGGGTGGCTACTTGGTGGAGGAG 


2130 


Db 


2580 


2639 


Qy 


2131 


CCACGGGTCCTGCACTTCT^GGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTG 

II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 
CCCAAGCCTTTGCTCTTTAAGGACAGTTACCACAACCTACGCCTCTCCCTCCATGACATC 


2190 


Db 


2640 


2699 


Qy 


2191 


CCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCAC 

III III 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 II 1 
CCCCATGCCCACTGGAGGAGCAAACTACTGGCCAAGTACCAGGAGATTCCCTTCTACCAC 


2250 


Db 


2700 


2759 


Qy 


2251 


ATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCC 

1 M 1 1 1 II 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 
GTCTGGAATGGCAGCCAGAGAGCCCTGCACTGCACTTTCACCCTGGAGAGGCATAGCCTG 


2310 


Db 


2760 


2819 


Qy 


2311 


AGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGC 

III II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 
GCCTCCACGGAGTTCACCTGTAAGGTCTGCGTGCGGCAGGTCGAAGGGGAAGGCCAGATT 


2370 


Db 


2820 


2879 


Qy 


2371 


TTCAGCATCAACTTCAACATCACC AAGGACACAAGGTTTGCTGAGCTGCTGGCTCTG 

III III 1 III II III II II 
TTCCAGCTGCACACAACGTTGGCCGAGACGCCTGCTGGCTCCCTGGATGCTCTCTGCTCT 


2427 


Db 


2880 


2939 


Qy 


2428 


GAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTC 

1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 
GCCCCGGGCAATGCCATCACCACCCAGCTGGGACCCTATGCCTTCAAGATACCCCTGTCC 


2487 


Db 


2940 


2999 


Qy 


2488 


ATTCGGCAGAAGAT7UVTTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGG 

II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 
ATCCGCCAAAAGATCTGCAGCAGCCTGGACGCCCCCGACTCCCGGGGCAACGACTGGAGG 


2547 


Db 


3000 


3059 


Qy 


2548 


ACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGC 

1 11 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 III 
CTGTTGGCGCAGAAGCTGTCCATGGACCGGTACCTAAACTACTTCGCCACCAAAGCTAGT 


2607 


Db 


3060 


3119 


Qy 


2608 


CCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGC 

II II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 
CCCACAGGTGTCATCTTAGACCTCTGGGAAGCTCGGCAACAGGATGACGGGGACCTCAAC 


2667 


Db 


3120 


3179 


Qy 


2668 


CAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCG 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 Ml 1 
AGCCTGGCCAGTGCCTTGGAGGAGATGGGCAAGAGTGAGATGCTGGTAGCCATGGCCACA 


2727 


Db 


3180 


3239 


Qy 


2728 


GAGGCTGAGTGCTGAG 2743 





Ill II 1 1 II I II 

Db 3240 GATGGCGATTGCTGAG 3255 
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1. .2802 

/organism="Pan troglodytes" 

/mol_type=" genomic DNA" 

/db_xref="taxon:9598" 

<1. .>2802 

/gene="UNC5C" 

/locus__tag="HCM2575" 



ORIGIN 



Query Match 29.6%; 
Best Local Similarity 53.8%; 
Matches 1477; Conservative 



Score 814; DB 29; 
Pred. No. 1.2e-128; 
0; Mismatches 1200; 



Length 28 02; 
Indels 69; Gaps 



6; 



Qy 

Db 

Qy 

Db 



57 GCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGG 116 

I I I I II I I MM II II I I I I I I I Ml I 

66 GCTCGTGCTACCTGCCCTGGCCCTGCTCAGCGCCAGCGGCACTGGCTCCGCCGCCCAAGA 125 

117 TGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCC 17 6 

M I I I I I I I I M I II I I I I I M 

126 TGATGACTTTTTTCATGAACTCCCAGAT^ACTTTTCCTTCGGATCCACCTGAGCCTCTGCC 185 



Qy 177 CCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGT 236 

I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II 

Db 186 ACATTTCCTTATTGAACCTGAGGAAGCTTATATTGTGAAGAATAAGCCTGTGAACCTGTA 245 

Qy 237 GTGCAAGGCC GTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGT 290 

II II II I 

Db 246 CTGTAAAGCAAGCCCTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 305 

Qy 291 GCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCAT 350 

Db 306 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 365 

Qy 351 GGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATA 410 

Db 366 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 425 

Qy 411 CTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACAT 470 

Db 426 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 485 

Qy 471 CCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCT 530 

I I II I I M II M I I I I I I I I I I I I I I I I I I I I 

Db 4 86 NNNNNNNNNNNNNNNNCGGAAGACATTTGAGCAGGAACCCCTAGGAAAGGAAGTGTCTTT 545 

Qy 531 GGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGA 590 

II I I I I I I I I I I I I I M I I I I I I I I I I I II I Mill 

Db 546 GGAACAGGAAGTCTTACTCCAGTGTCGGCCACCTGAAGGGATCCCAGTGGCTGAGNNNNN 605 

Qy 591 GTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCG 650 

I I I I I I I I INN III I M I M I I I I 

Db 606 NNNNNNGA7VAAATGAAGACATAATTGATCCCGTTGAAGATCGGAATTTTTATATTACTAT 665 

Qy 651 GGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGT 710 

I I I I I I II I I I I I I M I I I I I I II I I I I I I I I I I I II I I I 

Db 666 TGATCACAACCTCATCATAAAGCAGGCCCGACTCTCTGATACTGCAAATTATACCTGTGT 725 

Qy 711 GGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAA 770 

I M I I I I I I I II I I I II I III M I I I II I I I I I II I I 

Db 726 TGCCAAAAACATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATAGTCTATGTCAA 785 

Qy 771 CGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTG 830 

I I I I II I I I I I II I I I II I II I I II I II II I I I II I I I I I II II I 

Db 7 86 CGGTGGCTGGTCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTA 845 

Qy 831 GCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGA 890 

M I I I I I I I III I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I 

Db 846 TCAGAAACGTACAAGGACTTGTACCAACCCGGCACCACTCAATGGGGGTGCCTTCTGTGA 905 

Qy 891 GGGGCAGAATGTCCAGATW^CAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAG 950 

I I I I M I I I I I I I I M I I I I I I I III I M I I I I I I I I I I I I I I I I I 

Db 906 AGGGCAGAGTGTGCAGAAAATAGCCTGTACTACGTTATGCCCAGTGGATGGCAGGTGGAC 965 

Qy 951 CCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTG 1010 

II I I I II I I I II II I I I I I I I I II I I I I I I I I I I II II II I I I I I I 

Db 966 GCCATGGAGCAAGTGGTCTACTTGTGGAACTGAGTGCACCCACTGGCGCAGGAGGGAGTG 1025 



Qy 1011 CTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCG 1070 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



III II I II III I I I I I M I I II I I I I I II Mill II 

1026 CACGGCGCCAGCCCCCAAGAATGGAGGCAAGGACTGCGATGGCCTCGTCTTGC7\ATCCAA 1085 
1071 CAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTA 1130 

II III M M II I I I M I I II I M I I I I I I I I M II I I I 

1086 GAACTGCACTGATGGGCTTTGCATGCAGACTGCTCCTGATTCAGATGATGTTGCTCTCTA 1145 

1131 TGTGGG CCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCT 1187 

I II I I I I I M I I I I I I I I M II I I I I I I I II I 
1146 TGTTGGGATTGTGATAGCAGTGATCGTTTGCCTGGCGATCTCTGTAGTTGTGGCCTTGTT 1205 

1188 CGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCAC 1247 

I I I I I I III I I I M I I M II I I I II II M I I I I 

1206 TGTGTATCGGAAGAATCATCGTGACTTTGAGTCAGATATTATTGACTCTTCGGCACTCAA 12 65 

1248 CTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCAC 1307 

I I I I I I I I I I I I I M I I II I MM I I II I II II I 

1266 TGGGGGCTTTCAGCCTGTGAACATCAAG GCAGCAAGACAAGATCTGCTGGC 1316 

1308 CATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCA 1367 

II II I I I II I I I I I III III M I II I I II I I II 
1317 TGTACCCCCAGACCTCACGTCAGCTGCAGCCATGTACAGAGGACCTGTCTATGCCCTGCA 137 6 

1368 GGATG GGCCCAGCCCCAAGTTCCAGCTCACCAAT GGGCACCTGCTCAGCCC 1418 

III II I II II I I M II I I II I I I I I I II I I I I 

1377 TGACGTCTCAGACAAAATCCCAATGACCAACTCTCCAATTCTGGATCCACTGCCCAACCT 1436 

1419 CCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTT 1478 

I I II II I I I II III I Mill 

1437 GAAAATCA7\AGTGTACAACACCTCAGGTGCTGTCACCCCCCAAGATGACCTCTCTGAGTT 14 96 

1479 CGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACAT 1538 

III II II I I I I I I I I II I II I I I I 

1497 TACGTCCAAGCTGTCCCCTCAGATGACCCAGTCATTGTTGGAGAATGAAGCCCTCAGCCT 1556 

1539 GA CCTATGGGACCTTCAA 1556 

II I I I II I I II I 

1557 GAAGAACCAGAGTCTAGCAAGGCAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCNN 1616 

1557 CTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGA 1616 

I I I II I I I I I I I I I I I I I I I I I II I I I I I I I II I I 

1617 NTNGCTGGGNNNNCACCTTATTGTTCCCAATTCAGGAGTCAGCTTGCTGATTCCCGCTGG 167 6 

1617 TGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGT 167 6 

II II I II I I I II II M II II I I I I III I M II I III I 

1677 GGCCATTCCCCAAGGGAGAGTCTACGAAATGTATGTGACTGTACACAGGAAAGAAACTAT 1736 
1677 GAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCC 1736 

II II II I I I M M II M II I I III II I II M M I II I I II I I 

1737 GAGGCCACCCATGGATGACTCTCAGACACTTTTGACCCCTGTGGTGAGCTGTGGGCCCCC 1796 
1737 TGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCC 17 96 

Ml II II II II II I II II I M II I II II I II I II I II I I II I 

1797 AGGAGCTCTGCTCACCCGCCCCGTCGTCCTTACTATGCATCACTGCGCAGACCCCAATAC 1856 



1797 TGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCT 
II II I I I II II M I I I I I I I 11 II I I I I I I I I I M I I 



1856 



Db 1857 CGAGGACTGGAAAATACTGCTCAAGAACCAGGCAGCACAGGGACAGTGGGAGGATGTGGT 1916 

Qy 1857 GCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTG 1916 

I I I I I I I I I III I I I I I I I I I I I I I I I I II I I I I I 

Db 1917 GGTGGTCGGGGAGGAAAACTTCACCACCCCCTGCTACATTCAGCTGGATGCAGAGGCCTG 1976 

Qy 1917 CTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGC 1976 

I I I M I I I I I I I I I I II II I II II I I I I I I I II III II 

Db 1977 CCACATCCTCACAGAG7VACCTCAGCACCTACGCCCTGGTAGGACATTCCACCACCAAAGC 2036 

Qy 1977 TGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTA 2036 

I I I I II I I I II I I II II I I I I I I I I I I M I I M I I I I I I I I I I I 

Db 2037 GGCTGCGAAGCGCCTCAAGCTGGCCATCTTTGGGCCCCTGTGCTGCTCCTCGCTGGAGTA 2096 

Qy 2037 CAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCT 2096 

II I I II M I I I M I I I II I I I I I I I I I I I II II I I I I I II I I I I I I I 

Db 2 097 CAGCATCCGAGTCTACTGTCTGGATGACACCCAGGATGCCCTGAAGGAAATTTTACATCT 2156 

Qy 2097 GGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAG 2156 

MM I II II I I II I II II I I I I I II II I I II M II I I II 
Db 2157 TGAGAGACAGATGGGAGGACAGCTCCTAGAAGAACCTAAGGCTCTTCATTTTAAAGGCAG 2216 

Qy 2157 TTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCT 2216 

M II II II I M I M I I I M I I M II I II I II I I II I I I I II II I 

Db 2217 CACCCACAACCTGCGCCTGTCAATTCACGATATCGCCCATTCCCTCTGGAAGAGCAAATT 2276 

Qy 2217 CCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTT 2276 

II I I II I I I I I I 

Db 2277 GCTGGCTAAATATCAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNT^CCT 2336 

Qy 2277 GCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCT 2336 

M M I II I I I I I I I I II I I I I I I I II I I II II II II M I M I I 

Db 2337 GCACTGCACCTTCACTCTGGAAAGATTTAGCCTGAACACAGTGGAGCTGGTTTGCAAACT 2396 

Qy 2337 GTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAA 2396 

I I M I II II II I I II II I I II II I I I M I I M I I I I M I I I I 

Db 2397 CTGTGTGCGGCAGGTGGAAGGAGAAGGGCAGATCTTCCAGCTCAACTGCACCGTGTCAGA 2456 

Qy 2397 GGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGT 2456 

I II I I II II I I I I I I I II I I I I I 

Db 2457 GGAACCTACTGGCATCGATTTGCCGCTGCTGGATCCTGCGAACACCATCACCACGGTCAC 2516 

Qy 2457 GGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGA 2516 

I II I II I I I I I I II I Mill III I I II I II II M I I I II I II I I I 
Db 2517 GGGGCCCAGTGCTTTCAGCATCCCTCTCCCTATCCGGCAGAAGCTCTGTAGCAGCCTGGA 2576 

Qy 2517 CCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAG 2576 

I I I I I I I I I II I I II II I II I I I I I I I I I 

Db 2577 TGCCCCCCAGACGAGAGGCCATGACTGGAGGATGCTGGCCCATAAGCNNNNNNNNNNNNN 2636 

Qy 2577 CCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGA 2636 

I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I 
Db 2637 GTACTTGAATTACTTTGCCACCAAATCCAGCCCAACTGGCGTAATCCTGGATCTTTGGGA 2696 

Qy 2637 GGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGG 2696 

I I I I I I I I I M I I I I I I I I I M I I I I I I I I I Ml I I I I I I 

Db 2697 AGCACAGAACTTCCCAGATGGAAACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGG 2756 



Qy 2697 CCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I Mill I II I III I I 
Db 2757 AAGACATGAAACGGTGGTGTCCTTAGCAGCAGAAGGGCAGTATTAA 2802 
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AY411747 2532 bp DNA linear GSS 12-DEC-2003 

Homo sapiens HCM4327 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence, 
AY411747 

AY411747.1 GI: 39767715 
GSS. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2532) 

Clark, A. G., Glanowski, S . , Nielson, R. , Thomas, P., Ke j ariwal , A. , 
Todd, M. A., Tanenbaum, D.M, , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J . J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human -chimp -mouse orthologous 

gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 2532) 

Clark,A.G., Glanowski, S . , Nielson, R., Thomas,?., Kej ariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J . J. , 
Adams, M.D. and Cargill,M. 

Direct Submission 

Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. 

Location/Qualifiers 

1. .2532 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon:9606" 
<1. .>2532 

/locus_tag="HCM4327" 



Query Match 29.4%; 

Best Local Similarity 61.1%; 
Matches 154 0; Conservative 



Score 810.4; DB 29; 

Pred. No. 4.6e-128; 
0; Mismatches 851; 



Length 2532; 

Indels 129; Gaps 



9; 



Qy 

Db 



352 GAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATAC 411 

I I I I I I II I II II I I I I I I I II I I II I I I M I II II II II II I I I 
13 GAGGTGCAGATCGAGGTGTCGCGGCAGCAGGTGGAGGAGCTCTTTGGGCTGGAGGATTAC 72 



Qy 

Db 



412 TGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATC 471 

II I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I II I I I I I I I I I II II 

73 TGGTGCCAGTGCGTGGCCTGGAGCTCCGCGGGCACCACCAAGAGTCGCCGAGCCTACGTC 132 



Qy 472 CGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTG 531 

Mill III I I II I II I II I I I M I I I I I I I I I II II I I I I I I I I I I I I I I I 

Db 133 CGCATCGCCTACCTGCGCAAGAACTTCGATCAGGAGCCTCTGGGCAAGGAGGTGCCCCTG 192 

Qy 532 GAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAG 591 

I I I I I I I I I I I II I I I II I I I M I I I I III I I I I I I I M M 

Db 193 GACCATGAGGTTCTCCTGCAGTGCCGCCCGCCGGAGGGGGTGCCTGTGGCCGAGGTGGAA 252 

Qy 592 TGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGG 651 

I I I I I I I M I I I I I I I I I II I I I I I II I I I I I I I I I I 

Db 253 TGGCTCAAGAATGAGGATGTCATCGACCCCACCCAGGACACCAACTTCCTGCTCACCATC 312 

Qy 652 GAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTG 711 

I I II II III I I M I I I II M I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 313 GACCACAACCTCATCATCCGCCAGGCCCGCCTGTCGGACACTGCCAACTATACCTGCGTG 372 

Qy 712 GCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAAC 771 

I I I I I I II I M M I I I I I I I I I I I I I I I I I I II I I I I I I II II I I I I I 

Db 373 GCC/iAGAACATCGTGGCCAAACGCCGGAGCACCACTGCCACCGTCATCGTCTACGTGAAT 432 

Qy 772 GGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGG 831 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I 

Db 433 GGCGGCTGGTCCAGCTGGGCAGAGTGGTCACCCTGCTCCAACCGCTGTGGCCGAGGCTGG 492 

Qy 832 CAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAG 891 

Mill II I Mill II II I I I I I I I I M II I M II I II II II I II II III 

Db 493 CAG7\AGCGCACCCGGACCTGCACCAACCCCGCTCCACTCAACGGAGGGGCCTTCTGCGAG 552 

Qy 892 GGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGC 951 

II III MIIMI II IIIIM Mill I llllllll II II MM 

Db 553 GGCCAGGCATTCCAGAAGACCGCCTGCACCACCATCTGCCCAGTCGATGGGGCGTGGACG 612 

Qy 952 CCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGC 1011 

II II II II I II M I I Mill I II II I I I II I M M II II I II M II 

Db 613 GAGTGGAGCAAGTGGTCAGCCTGCAGCACTGAGTGTGCCCACTGGCGTAGCCGCGAGTGC 672 

Qy 1012 TCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGC 1071 

I II MUM I I M I II I Mill MM M II I I 

Db 673 ATGGCGCCCCCACCCCAGAACGGAGGCCGTGACTGCAGCGGGACGCTGCTCGACTCTAAG 732 

Qy 1072 AACTGTACCAGTGACCTCTGTGTACACAGTGCTTCT 1107 

II II I II M I I I I Mill 

Db 733 AACTGCACAGATGGGCTGTGCATGCAAAATAAGAAAACTCTAAGCGACCCCAACAGCCAC 792 

Qy 1108 GGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC ATCGCCGTGGCCGTC 1155 

I I I I II I I II I II II II I I I I II I II I I Ml 

Db 793 CTGCTGGAGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTG 852 

Qy 1156 TGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTG 1215 

I II I Mill II I II II I II M I I I I 

Db 853 GTCGTGGCAATCCTCATGGCGGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTC 912 

Qy 1216 GACTCAGATGTGGCTGACTCGTCCATT CTCACCTCAGGCTTCCAGCCCGTCAGCATC 1272 

II II II I I II I I II M I I MM I II M M II II II II I 

Db 913 GACACAGACATCACTGACTCATCTGCTGCCCTGACTGGTGGTTTCCACCCCGTCAACTTT 972 



Qy 1273 AAGCCCAGCAAAGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGC 1326 

I I I I I I M I M I I I I I I I II II II I II I I I 

Db 973 AAGACGGCAAGGCCCAGCAACCCGCAGCTCCTACACCCCTCTGTGCCTCCTGACCTGACA 1032 

Qy 1327 ACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGG 1374 

III I I I II I I I I I II I I I I I I I I I I II 

Db 1033 GCCAGCGCCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATC 1092 

Qy 1375 CCCAGCCCCAAGTTCCAGCTCACCA — 1399 

II I II I I II I I I II M II 

Db 1093 CCCATGACCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGC 1152 

Qy 1400 ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCC ACACA 1440 

I II II I I III II I I I I I I I I 

Db 1153 TCCAGCACCACGGGCTCTGGGCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTG 1212 

Qy 1441 CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 

III I I I M I I I I II I I I I I I I I I I I 

Db 1213 CCGCCTGGCACATACCCTAGCGATTTCGCCCGGGACACCCACTTCCTGCACCTGCGCAGC 1272 

Qy 1501 AACTACTTCCGCTC CCTGCCCCGAGGCACCAGCAACATGACCTAT 1545 

I I II I I I I I II I I I I I I I I I I I I I I 

Db 1273 GCCAGCCTCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAGCAGCGTCAGC 1332 

Qy 154 6 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

II I M I I I I I I I I I I I I I II I I II I I Mill I I I M I I I 

Db 1333 GGCACCTTTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTG 1392 

Qy 1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I 

Db 1393 GTGCCCAATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAG 1452 

Qy 1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

I I II II II II I I II I I I I M I I I I I I I 

Db 1453 GCAGAAAGTACCCTCCCGCTTTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACC 1512 

Qy 1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 1513 TGTGGACCCACAGGCCTCCTGCTGTGCCGCCCCGTCATCCTCACCATGCCCCACTGTGCC 1572 

Qy 17 8 6 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 

II III I I I I I I I I I I I I I I I I I I I I I I I I I MM 

Db 1573 GAAGTCAGTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCAGGGCCACTGG 1632 

Qy 1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I I I M I M I I Mill II II II I I I I I I I II II I I I I I I I I I M 

Db 1633 GAGGAGGTGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAG 1692 

Qy 1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

II II III II Mill M II II II II I II I 1 II I II I M 

Db 1693 CCCAGGGCCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCC 1752 

Qy 1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 

I I II I I II II I I II I I II I I M II II I I I M I II 

Db 1753 TATTCCCGCTCAGCAGTCAAGCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACC 1812 

Qy 2026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 



Db 


1813 


TCCCTGGAGTACAGCCTCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAG 


1872 


Qy 


2086 


GTGGTGCAGCTGGAG7\AGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III II II 1 M II 

GTGCTGGAGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATG 


2145 


Db 


1873 


1932 


Qy 


2146 


TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 

II II M II M 1 II II II 1 1 1 II 1 1 1 1 1 1 1 1 M II II II 1 1 II III III 
TTCAAGGACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGG 


2205 


Db 


1933 


1992 


Qy 


2206 


AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 

1 1 II 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

AGGAGCAAGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGC 


2265 


Db 


1993 


2052 


Qy 


2266 


CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 

ill 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 III 1 1 1 1 
CAGAAGGCCCTCCACTGCACTTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTC 


2325 


Db 


2053 


2112 


Qy 


2326 


GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 III 1 1 1 
ACCTGCAAGATCTGCGTGCGGCAAGTGGAAGGGGAGGGCCAGATATTCCAGCTGCATACC 


2385 


Db 


2113 


2172 


Qy 


2386 


AACATCACCAAG GACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 

1 1 1 1 1 III II II 1 II 
ACTCTGGCAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACT 


2442 


Db 


2173 


2232 


Qy 


2443 


GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 

1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTCACCACCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATA 


2502 


Db 


2233 


2292 


Qy 


2503 


ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 

II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 
TGCAACAGCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAG 


2562 


Db 


2293 


2352 


Qy 


2563 


CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 

Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 

CTCTCTATGGACCGGTACCTGAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATC 


2622 


Db 


2353 


2412 


Ov 


2623 


CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 

II M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1. II 
CTGGACCTCTGGGAAGCTCTGCAGCAGGACGATGGGGACCTCAACAGCCTGGCGAGTGCC 


2682 


Db 


2413 


2472 


Qy 


2683 


GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TTGGAGGAGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGGGGACTGCTGA 


2742 


Db 


2473 


2532 
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Mus musculus HCM4327 gene, VIRTUAL TRANSCRIPT, partial sequence, 
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AY411749 

AY4 11749.1 GI: 39767717 
GSS. 

Mus musculus (house mouse) 



ORGANISM Mus itius cuius 
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Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
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FEATURES Location/Qualifiers 
source 1. .2532 

/organism="Mus musculus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 10090" 
gene <1 . • >2532 

/locus_tag="HCM4327" 

ORIGIN 



Query Match 28.4%; 
Best Local Similarity 60.1%; 
Matches 1515; Conservative 



Score 780.4; DB 29; 
Pred. No. 6.3e-123; 
0; Mismatches 876; 



Length 2532; 
Indels 129; Gaps 



8; 



Qy 


352 


Db 


13 


Qy 


412 


Db 


73 


Qy 


472 


Db 


133 


Qy 


532 


Db 


193 


Qy 


592 


Db 


253 



1 1 1 



II I 1 1 1 1 1 1 1 1 1 1 



II III 



I II M I M I I I II I 



TGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATC 

II M II II II II II II I I I I M I I I I I I I I II I M I I I II I I I I I II I M I 

TGGTGCCAGTGCGTGGCCTGGAGCTCTTCGGGAACTACCAAGAGTCGCCGAGCCTACATC 

CGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTG 

I I I I I III I I I I I I I II II I II II I I I I I I II I I I II I I II I I I I I M II 

CGCATTGCCTACTTGCGCAAGAACTTTGACCAGGAGCCTCTGGCCAAGGAGGTACCCTTG 



471 



132 



531 



192 



591 



GAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAG 

I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II I I 
GATCATGAGGTCCTTCTGCAGTGCCGCCCACCGGAGGGAGTGCCTGTGGCTGAGGTGGAA 252 



TGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGG 

II I I I I I M II II I I I I I I I I I I I I I I II 1 MM 

TGGCTCAAGAATGAAGATGTCATTGACCCCGCTCAGGACACTAACTTCCTGCTCACCATT 



651 



312 



Qy 



652 GAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTG 711 
I I II II II I I I II II II I I I M I I I M II M II II II I II II II II I 



313 GACCACAACCTCATCATCCGCCAGGCGCGCCTCTCAGACACGGCCAACTACACCTGTGTG 372 



712 GCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAAC 771 

IIIMIM IMIIIII II M I II I Ill Mill 

373 GCCAAGAATATCGTGGCCAAGCGCCGGAGCACCACGGCCACAGTCATCGTCTATGTGAAT 432 

772 GGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGG 831 

M II I I II I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I 

433 GGAGGCTGGTCCAGCTGGGCAGAGTGGTCACCCTGTTCCAATCGCTGTGGCCGAGGCTGG 492 

832 CAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAG 891 

Mill II I MM Mlllllll II II II Mill II Mill lllllllll 

493 CAGAAGCGTACTCGGACCTGCACCAATCCAGCCCCACTCAATGGAGGCGCCTTCTGTGAG 552 

892 GGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGC 951 

II III I I II I I I Mill III I I I I I I I II I I II I I II II I II I I 

553 GGACAGGCCTTCCAGAAGACAGCTTGCACCACCGTGTGCCCAGTGGATGGAGCGTGGACC 612 

952 CCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGC 1011 

I I I I I I I I I I I Ill I I I I I I I I I I II I I I I I I II 

613 GAGTGGAGCAAGTGGTCTGCCTGCAGCACAGAGTGTGCGCACTGGCGCAGCCGCGAGTGC 672 

1012 TCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGC 1071 

I II I I I I II IMIIIII II I II II I I I I M I I I 

67 3 ATGGCACCGCCACCCCAGAACGGAGGCCGTGACTGCAGCGGGACGCTACTTGACTCCAAG 732 

1072 AACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGAC 1119 

I II II II II I I M I I I II I I I I I 

733 AACTGCACTGATGGGCTGTGCGTGCTGAATCAGAGAACTCTAAACGACCCTAAAAGCCAC 792 

1120 GTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGC 1158 

II I II I I I I I Mill II I III 
793 CCCCTGGAGACATCGGGAGATGTGGCACTGTACGCAGGCCTTGTGGTGGCCGTCTTTGTG 852 

1159 CTGGTCCTGCTGCTGCTTGTCCT CATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTG 1215 

II II I I I I I I I I I II I I I I I I I I M I 

853 GTGGTAGCGGTTCTCATGGCCGTGGGAGTGATCGTATACCGGAGAAACTGCCGGGACTTC 912 

1216 GACTCAGATGTGGCTGACTCGT CCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATC 1272 

M I I I I I I I I I II I I I I II I II I I M II I I I 1 I I I I I 

913 GACACGGACATCACCGACTCCTCTGCGGCCCTCACTGGTGGCTTCCACCCTGTCAACTTC 972 

1273 AAGCCCAGC7WVGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGC 1326 

Mil I I I I I I I I I M I I I I M II IMIIIII 

973 AAGACTGCAAGGCCCAACAACCCGCAGCTCCTGCACCCGTCCGCCCCTCCAGACCTAACG 1032 

1327 ACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGG 1373 

Ml I II II I II M I I I I II I II II II 

1033 GCCAGTGCTGGCATCTACCGCGGGCCTGTGTATGCCCTGCAGGACTCCGCCGACAAGATC 1092 

1374 GCCCAGCCCCAAGTTCCAGCTCACCAAT 1401 

I II I I I I I I I I I II II II M 
1093 CCCATGACTAATTCGCCCCTGCTGGATCCCCTGCCCAGCCTCAAGATCAAGGTCTATAAC 1152 

1402 GGGCACCTGCTCAG CCCCCTGGGTGGCGGC 1431 

I II I II I II I II I I I 

1153 TCCAGCACCATCGGTTCTGGGTCTGGCCTGGCTGATGGAGCCGACCTGCTGGGTGTCCTC 1212 



Qy 


1432 


CGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTC 


1491 


Db 


1213 


II 1 1 1 1 II 1 1 1 1 Mill 1 1 III 
CCGCCGGGCACGTACCCAGGCGATTTCTCCCGGGACACCCATTTCCTGCACCTGCGCAGT 


1272 


Qy 


1492 


TCCACCCAGAACT ACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 

1 1 1 1 1 1 Mill 1 M M M 1 1 1 1 M M II 1 

GCCAGCCTTGGTTCCCAGCACCTCCTGGGCCTACCTCGGGACCCCAGCAGCAGTGTCAGC 


1545 


Db 


1273 


1332 


Qy 


1546 


GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 

M M M 1 1 M 1 M M 1 M II 1 II 1 1 II 1 II M II M 1 1 

GGCACCTTTGGTTGCCTGGGAGGAAGGCTGAGCCTCCCCGGCACAGGGGTCAGCCTGTTG 


1605 


Db 


1333 


1392 


Qy 


1606 


ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 

III 1 II II 1 II II II 1 M II II 1 1 1 1 II M 1 II 1 1 1 

GTACCAAATGGAGCCATTCCCCAGGGCAAGTTCTATGACCTGTATCTACATATCAACAAG 


1665 


Db 


1393 


1452 


Qy 


1666 


CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 

1 1 II 1 1 II Ml Mill 1 II II II 1 II 1 1 
GCCGAAAGCACCCTCCCACTTTCAGAAGGTTCCCAGACAGTATTGAGCCCCTCGGTGACC 


1725 


Db 


1453 


1512 


Qy 


1726 


TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 

1 II II 1 II 1 M 1 1 M II II 1 II II 1 1 1 II 1 II M 1 1 1 II 1 
TGTGGGCCCACAGGCCTACTCCTGTGCCGCCCTGTCGTCCTCACCGTGCCCCACTGTGCT 


1785 


Db 


1513 


1572 


Qy 


1786 


GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 

M Mini 1 M M 1 1 1 II II 1 1 II 1 1 Mill II II 

GAAGTCATCGCTGGAGACTGGATCTTTCAGCTCAAGACCCAGGCCCATCAGGGCCACTGG 


1845 


Db 


1573 


1632 


Qy 


1846 


GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 

II II 1 1 II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II II M II 1 II II II II 1 
GAGGAGGTGGTGACCTTGGATGAGGAGACCCTCAACACACCCTGCTACTGCCAGCTGGAG 


1905 


Db 


1633 


1692 


Qy 


1906 


GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 

1 1 1 II 1 II II II 1 M 1 1 M II M II 1 1 II II M 1 M 

GCTAAGTCCTGCCACATCCTGCTGGACCAGCTGGGTACCTACGTATTCATGGGCGAGTCC 


1965 


Db 


1693 


1752 


Qy 


1966 


CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 

1 1 1 1 1 1 1 1 1 II 1 II 1 1 M II 1 II II II 1 1 II M M 
TACTCTCGCTCTGCAGTCAAGCGGCTCCAGCTGGCCATCTTCGCCCCAGCCCTCTGCACC 


2025 


Db 


1753 


1812 


Qy 


2026 


TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 

M II 1 II 1 M 1 1 II M II II II 1 1 M 1 1 1 M 1 1 1 1 II 1 M 1 M II 1 

TCCCTGGAGTATAGCCTCAGGGTCTACTGTCTGGAGGACACACCTGTAGCACTGAAGGAG 


2085 


Db 


1813 


1872 


Qy 


2086 


GT GGT GCAGCT GGAGAAGCAGCT GGGGGGAC AGCT GAT CCAGGAGCCACGGGT C CT GCAC 

II II II II 1 II 1 1 1 II M 1 1 1 1 III II M II 1 1 II 1 1 
GTCCTGGAGCTGGAGAGGACTCTGGGTGGCTACTTGGTGGAGGAGCCCAAGCCTTTGCTC 


2145 


Db 


1873 


1932 




2146 


TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 

M II II II II 1 M 1 M M II M 1 M 1 1 II 1 II M M 1 II 1 Ml III 

TTTAAGGACAGTTACCACAACCTACGCCTCTCCCTCCATGACATCCCCCATGCCCACTGG 


2205 


Db 


1933 


1992 


Qy 


2206 


AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 

1 II 1 II II II 1 II II 1 M M II II II 1 1 1 II Ml II M M II II II 

AGGAGCAAACTACTGGCCAAGTACCAGGAGATTCCCTTCTACCACGTCTGGAATGGCAGC 


2265 


Db 


1993 


2052 



Qy 2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

INI I I I I I I I I I I I I I M I I I I I M I I I I I I I I I II I 

Db 2053 CAGAGAGCCCTGCACTGCACTTTCACCCTGGAGAGGCATAGCCTGGCCTCCACGGAGTTC 2112 

Qy 2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

II II II I I II II I II II I II M II II II II I I Ml I II 

Db 2113 ACCTGTAAGGTCTGCGTGCGGCAGGTCGAAGGGGAAGGCCAGATTTTCCAGCTGCACACA 2172 

Qy 2386 AACATCACC AAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 2442 

I I II I I III II II I I I 

Db 2173 ACGTTGGCCGAGACGCCTGCTGGCTCCCTGGATGCTCTCTGCTCTGCCCCGGGCAATGCC 2232 

Qy 2443 GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 2502 

III I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I II I I 

Db 2233 ATCACCACCCAGCTGGGACCCTATGCCTTCAAGATACCCCTGTCCATCCGCCAAAAGATC 2292 

Qy 2503 ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 2562 

I I II I I I I II I I II I I I I I I I I M I I I I II II I I I I I 

Db 2293 TGCAGCAGCCTGGACGCCCCCAACTCCCGGGGCAACGACTGGAGGCTGTTGGCGCAGAAG 2352 

Qy 2563 CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 2622 

II I I I I II I I I I I I I I I I I I I I I I I I I I I II M I I I I I I 

Db 2353 CTGTCCATGGACCGGTACCTAAACTACTTCGCCACCAAAGCTAGTCCCACAGGTGTCATC 2412 

Qy 2623 CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 2682 

I I II I I I I II II I I I II I I I I I I I I I I I I I I I I II 

Db 2413 TTAGACCTCTGGGAAGCTCGGCAACAGGATGACGGGGACCTCAACAGCCTGGCCAGTGCC 2472 

Qy 2683 GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

III I I I I II I I II I II I I II I I I I I I I I I 

Db 2473 TTGGAGGAGATGGGCAAGAGTGAGATGCTGGTAGCCATGGCCACAGATGGCGATTGCTGA 2532 
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BI758231 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



1034 bp mRNA linear EST 25-SEP-2001 
603029876F1 NIH_MGC_114 Homo sapiens cDNA clone IMAGE : 5200171 5', 
mRNA sequence. 
BI758231 

BI7 58231.1 GI: 15749809 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1034) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 
Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r @mail . nih . gov 
Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 



FEATURES 

source 



http : //image . llnl . gov 

Plate: LLAMllSOl row: g column: 20 

High quality sequence stop: 7 93. 

Location/Qualifiers 

1. .1034 

/organism="Homo sapiens " 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone="IMAGE: 5200171" 

/lab_host="DH10B" 

/ cl one__l ib= "N I H__MGC__1 14" 

/note="Organ: brain; Vector: pCMV-SP0RT6; Site_l : NotI; 
Site 2: EcoRV (destroyed); RNA source anonymous pool of 6 
male brains^ age range 23-27 yo. Library is oligo-dT 
primed and directionally cloned (EcoRV site is destroyed 
upon cloning) . Average insert size 1.5 kb, insert size 
range 1-3 kb . Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 019. Note: 
this is a NIH_MGC Library." 



ORIGIN 



Query Match 27.9%; 

Best Local Similarity 91.2%; 

Conservative 



Matches 


91^ 


Qy 


150 


Db 


1 


Qy 


210 


Db 


61 


Qy 


270 


Db 


121 


Qy 


330 


Db 


181 


Qy 


390 


Db 


241 


Qy 


450 


Db 


301 


Qy 


510 


Db 


361 


Qy 


570 


Db 


421 



Score 7 68.6; DB 12; 
Pred. No. 4.3e-121; 

0; Mismatches 74; 



Length 1034; 
Indels 15; Gaps 



9; 



209 



GCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACAT 

I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACAT 60 

CGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTT 269 

I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

CGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTT 120 

CAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGG 329 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGG 180 

GAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAA 389 

I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I i I M I I I 

GAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAA 24 0 

GGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCAC 44 9 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I 
GGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCAC 300 

CAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCC 509 

I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CAAGAGTCAGAAGGCCTACATCCGCATAGCCTATTTGCGCAAGAACTTCGAGCAGGAGCC 360 

GCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGG 569 

I I I I I I I I I I I I I M I 1 I I I I I I I I I I I I I I I I I I I I I I I M I I 1 I I I I I I I I I I I I I I I 
GCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGG 420 

CATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGA 62 9 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I M I M I I I I I I I I I I I I M M 

CATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGA 4 80 



Qy 630 CCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGA 689 

I I I I I I I I I I I I I I M I I I I I M M I I I I I I I I I I M I I I I M I I I I I I I I I I M I I I I I 

Db 481 CCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGA 540 

Qy 690 CACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGC 749 

Mill I I I I I I II I I I I I I I I I M I II II I I I I I I I II M I I M I I II I! II II II II I 

Db 541 CACGGACAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGC 600 

Qy 750 TGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGA-CGTGGACCGAGTGGTCCGTCTGCA 808 

M II M II II II I M I I II II I II M II I II I I M I I II I I I I I I I I I I I II I I M I I 

Db 601 TGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACCGTGGACCGAGTGGTCCGTCTGC- 659 

Qy 809 GCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTC 8 68 

M II M II II I II II II II M II I II II II I I I I I II I I I I I II I I 

Db 660 GCGCCAGCTGTGGGCGCGGCTGGCAGAAACCGGAGCCGGAGCTGCACAACCCGGTGCCTC 719 

Qy 869 TCAACGGGGGCGCTTTCTGTGA-GGGGCAGAATGTCCAGAAAZVCAGCCTGCGCCACCCTG 927 

I I I I I I I I I I I I I I I I II I II I I I I I I II I II I I II I I I II I I I I II II 

Db 720 TCAACGGGGGCGCTTTCTGTGAGGGGGCAGTU^TGTCCAGAAAGCAGC TGCGCCACCT 77 6 

Qy 928 TGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGT — GGGCTGGACT 985 

I M I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I II III I 
Db 777 GTGCCAGTGGACGGCAGCTGTAGCACGTGGAGCCAGTGGTCGGCCTGTTGGGCTTGGCTT 836 

Qy 98 6 GCACCCACT--GGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGA 1043 

I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I II I I MM 

Db 837 GCACCCACTTGGGCGGAGCCGGAGTGCTCTGAACCCAGCACCCCGGCACGGCAGGGGGAG 896 

Qy 1044 GTG CCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAG 1100 

II I M II I II I M I I M M II II I I II II I I I I 

Db 897 GTGTGCCCAGGTCACTGGACCTGGCACCCGGGA-TGGTCCAGTGAGCTCTGTGT-CCCAC 954 

Qy 1101 TGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCG 1147 

I II II I II M II II II I Mill II I M II M I I II I II I I 

Db 955 GGGTTCTGGCCCTGAGGACTTGGCCTCCTATGTGGGCCTCATCCCGG 1001 



RESULT 9 

BC033727 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BC033727 1532 bp mRNA linear HTC 19-NOV-2003 

Homo sapiens netrin receptor Unc5hl, mRNA (cDNA clone 
IMAGE: 51667 62) , containing frame-shift errors, 
BC033727 

BC033727.1 GI: 21707230 
HTC. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1532) 

Strausberg.R.L. , Feingold, E . A. , Grouse, L.H., Derge^ J.G. , 
Klausner.R.D. , Collins, F.S., Wagner, L., Shenmen, CM. , Schuler , G. D . , 
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaef er , C , F . , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko, L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M. B. , Bonaldo,M. F. , Casavant, T . L. , 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



FEATURES 

source 



Scheetz^T.E. , Brownstein,M. J. , Usdin.T.B., Toshiyuki, S . , 
Carninci.P., Prange,C., Raha,S.S., Loquellano.N . A. , Peters, G. J-, 
Abramson^R.D. , Mullahy.S.J., Bosak^S.A., McEwan.P.J., 
McKernan^K. J. , Malek^ J.A. , Gunaratne, P.H. , Richards, S. , 
Worley.K.C, Hale,S., Garcia, A. M. , Gay,L.J., Hulyk.S.W., 
Villalon,D.K. , Muzny^D.M., Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteinan,M. , Madan,A., Rodrigues , S . , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C, Shevchenko, Y. , 
Bouffard,G.G. , Blakesley, R. W. , Touchman, J. W. , Green, E.D., 
Dickson,M.C. , Rodriguez, A. C. , Grimwood, J. , Schmutz,J., Myers, R-M., 
Butterfield, Y. S. , Krzywinski,M. I . , Skalska,U., Sinailus , D . E . , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 

22388257 
12477932 

2 (bases 1 to 1532) 
Strausberg,R. 
Direct Submission 

Submitted ( 02- JUL-2 002 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact: MGC help desk 

Email: cgapbs-r(§mail . nih. gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC) , 

Gaithersburg, Maryland; 

Web site: http://www.nisc.nih.gov/ 

Contact : nisc_mgc@nhgri . nih . gov 

Akhter,N. , Ayele,K. , Beckstrom-Sternberg, S.M. , Benjamin, B. , 
Blakesley, R.W. , Bouf f ard, G. G. , Breen,K., Brinkley,C., Brooks, S., 
Dietrich,N.L. , Granite, S., Guan,X., Gupta, J., Haghighi,P., 
Hansen, N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., 
Maduro,Q.L., Masiello,C,, Maskeri,B., Mastrian, S . D. ,McCloskey, J. C . , 
McDowell, J., Pearson, R., Stantripop, S . , Thomas, P. J., Touchman, J. W. , 
Tsurgeon,C., Vogt,J.L., Walker, M. A., Wetherby, K. D . , Wiggins, L., 
Young, A., Zhang, L.-H. and Green, E.D. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/ LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 68 Row: i Column: 2 

This clone was selected for full length sequencing because it 
passed the following selection criteria: GenomeScan gene prediction 
This clone has the following problem: frame shifted. 
Location/Qualifiers 
1. .1532 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="IMAGE: 5166762" 
/tissue type="Brain, adult medulla" 



/ clone_l ib= "NIH_MGC__1 19" 
/lab_host="DH10B" 
/note="Vector: pCMV-SP0RT6" 

ORIGIN 

Query Match 26.8%; Score 736.6; DB 11; Length 1532; 

Best Local Similarity 84.2%; Pred. No. 1.5e-115; 

Matches 917; Conservative 0; Mismatches 4; Indels 168; Gaps 



Qy 

Db 


337 
612 


GGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 
GGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTC 


396 
671 


Qy 

Db 


397 
672 


GGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGT 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

GGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGT 


456 
731 


Qy 


457 


CAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCC 

M 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CAGAAGGCCTACATCCGCATAGCCTATTTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCC 


516 


Db 


732 


791 


Qy 

Db 


517 
792 


AAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCT 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M M 1 1 1 1 1 
AAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCT 


576 
851 


Qy 

Db 


577 
852 


CCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAAT 

1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

CCAGCCGAGGTGGAGTGGCTCCGGT^ACGAGGACCTGGTGGACCCGTCCCTGGACCCCAAT 


636 
911 


Qy 

Db 


637 
912 


GTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 
GTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCC 


696 
971 


Qy 

Db 


697 
972 


AACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

AACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTC 


756 
1031 


Qy 


757 


ATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGC 
1 1 1 1 1 M 1 1 1 1 1 


816 


Db 


1032 


1043 


Qy 


PIT 


TrTrir;rzrf::rr::f::rTf::r:rAr;AAArnp;Af;rrGRAGrTGCACCAACCCGGCGCCTCTCAACGGG 


876 


Db 


1044 




1043 




Qy 


877 


GGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTA 


936 


Db 


1044 




1043 




Qy 


937 


GACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGG 

1 1 1 1 M 1 M M 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 

GACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGG 


996 


Db 


1044 


1103 


Qy 

Db 


997 
1104 


CGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACT 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACT 


1056 
1163 



Qy 


1057 


Db 


1164 


Qy 


1117 


Db 


1224 


Qy 


1177 


Db 


1284 


Qy 


1237 


Db 


1344 


Qy 


1297 


Db 


1404 


Qy 


1357 


Db 


1464 


Qy 


1417 


Db 


1524 



GACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAG 1116 

I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I M I I M ! I I M I I I I M I I I I I I I 

GACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACACTGCTTCTGGCCCTGAG 1223 

GACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTT 1176 

I I I I I I I I I I I I I I I I I 1 I M I I I I I I I M I I M I I I I I I I I I I I I 11 I I I M I I I I I I I 
GACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTT 1283 

GTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCG 1236 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I M I I 

GTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCG 1343 

TCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCC 1296 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I 
TCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCC 1403 

CATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTC 1356 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
CATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTC 1463 

TGTCCCCGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGC 1416 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I M I I I I I I 
TGTCCCCGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGC 1523 



I I I I I I I I I 



RESULT 10 

AI951556 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



AI951556 788 bp mRNA linear EST 09-MAR-2000 

wv36f04.xl NCI_CGAP_0vl8 Homo sapiens cDNA clone IMAGE: 2531647 3' 
similar to TR:O08721 O08721 TRANSMEMBRANE RECEPTOR UNC5H1 . mRNA 
sequence . 
AI951556 

AI951556.1 GI: 5743866 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 788) 

NCI-CGAP http : //www, ncbi . nlm. nih . gov/ncicgap . 
National Cancer Institute, Cancer Genome Anatomy Project 
Tumor Gene Index 
Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r Qmail . nih. gov 

Tissue Procurement: Christopher A. Moskaluk, M.D., Ph.D., Michael 
R. Emmert-Buck, M.D., Ph.D. cDNA Library Preparation: M. Bento 
Scares, Ph.D. cDNA Library Arrayed by: Christa Prange, The 
I.M.A.G.E. Consortium DNA Sequencing by: Washington University 
Genome Sequencing Center 

Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/ image . html 



Craniata ; Vertebrata ; Euteleos tomi ; 
Catarrhini; Hominidae; Homo. 



(CGAP) , 



Insert Length: 1125 Std Error: 0.00 
Seq primer: -40UP from Gibco 
High quality sequence stop: 446. 
FEATURES Location/Qualifiers 
source 1. .788 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/clone="IMAGE: 2531647" 

/tissue_type="f ibrotheoma" 

/lab_host="DH10B (phage-resistant ) " 

/clone_lib="NCI_CGAP_0vl8" 

/note="Organ: ovary; Vector: pT7T3D-Pac (Pharmacia) with a 
modified polylinker; Site_l : Not I; Site_2 : Eco RI; 1st 
strand cDNA was primed with a Not I - oligo(dT) primer [5* 
TGTTACCAATCTGAAGTGGGAGCGGCCGCGCGACATTTTTTTTTTTTTTTTTT 3 * ] ; 
double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Not I and cloned into the Not 
I and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization, and was 
constructed by Bento Soares and M. Fatima Bonaldo. " 

ORIGIN 

Query Match 26.7%; Score 735.2; DB 9; Length 788; 

Best Local Similarity 96.5%; Pred. No. 1.9e-115; 

Matches 749; Conservative 0; Mismatches 27; Indels 0; Gaps 0; 

1469 CCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCA 1528 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I 

1 CCGAGGAGTTCGTCTCCCGCCTCTCCGCCCAGAACTACTTCCGCTCCCTGCCCCGAGGCA 60 

1529 CCAGCT^ACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATA 158 8 
M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 CCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATA 12 0 

1589 CAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCT 1648 

MM M M M M M M M M M I M M M M M M M M M M M M M M M M M M 

121 CAGGAATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCT 18 0 
1649 ACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGC 1708 

M M I M I M M I M M M M M M M M M M M M M M M M M M M M I M M M 

181 ACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGC 240 
17 09 TGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGG 17 68 

M M I M I M M M M I M M M M M M M M I M M M M M M M I M M M M M 

241 TGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTTCTGCTCACCCGGCCAGTCATCCTGG 300 
1769 CTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGT 1828 

M M I M M M M n M M M M M M M M M M M M M M M M M M M M M M I 

301 CTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGT 360 
1829 CGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCT 1888 

M M M M M M M M M M I M M I M M M M M M M I M M M M M M M M M I 

361 CGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCT 420 
1889 ACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTG 1948 

M M M M M M M M M M M M M M M M M M M M M I I I M I I I I I I I M II 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



Db 



421 ACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGAGCCGCTATG 480 



Qy 1949 CCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTG 2008 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I 
Db 481 CCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTG 540 

Qy 2009 CGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCC 2068 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 CGCCGGTGGCCTGCACCTCCCTCGAGTACAACATACTGGTCTACTGCCTGCATGACACTC 600 

Qy 2069 ACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGG 2128 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 601 ACGATGCACTCAACGTAGTGGTGCAGCTGGAGAAGCAGCTGCAGGGACAGCTGATCCAGG 660 

Qy 2129 AGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATG 2188 

I I I I I I III I II I I I I M II M II I I I M II II II II II I I M I II I I I I I I I I I I I 

Db 661 AGCCACTGGTACTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATNCATCCACGATG 720 

Qy 2189 TGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTT 2244 

I I M II I II II I I I I I II I I I I I I I I M I I I I I M II I I I M II I 

Db 721 TGCCCAGCTNCCNTGTGGAGAGTAAGCTTCTTGTCAGCTACCCAGAGATCCNCTAT 776 



RESULT 11 

BX348193/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BX348193 843 bp rtiRNA linear EST 05-MAY-2003 

BX348193 Homo sapiens NEUROBLASTOMA COT 10-NORMALIZED Homo sapiens 
cDNA clone CS0DB008YE02 5-PRIME, mRNA sequence. 

BX348193 

BX34 8193. 1 GI: 30367258 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 843) 

Li,W.B., Gruber.C, Jessee.J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished (2001) 
Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref @genoscope . ens . f r^ Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 3239. r For 
more information about this cluster, see 
http : //www. genoscope. ens . f r/ 

cgi-bin/cluster .cgi?seq=CSOBAF004ZD01_AF00293_l&cluster=3239. r . 
Contact : Feng Liang Email : fliang@lifetech.com URL : 
http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAF004ZD01_AF00293_1 . 
Location/Qualifiers 
1. .843 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DB008YE02" 



/tissue_type="NEUROBLASTOMA COT 10 -NORMALIZED" 
/clone_lib="Hoino sapiens NEUROBLASTOMA COT 10 -NORMAL I ZED" 
/note="lst strand cDNA was primed with a Notl-oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 



ORIGIN 



Query Match 26.3%; Score 722.8; DB 13; Length 843; 

Best Local Similarity 95.3%; Pred. No. 2.6e-113; 

Matches 776; Conservative 0; Mismatches 34; Indels 4; Gaps 3; 

Qy 1653 CACGCTGCACAAGCCGGAAGACGTGAGG — TTGCCCCTAGCTGGCTGTCAGACCCTGCTG 1710 

I I I I I I I M I I I I I I I III I III I I II II I I I I I I I I I I I I I 

Db 824 CACGCGGCCAAAAGCGGAAGAGTGTAGGGTTGCCCCAAACCGGGCTGTCAGACCCTGTTG 765 

Qy 1711 AGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCT 1770 

II I I I II I I I I I I I M M I I I M M I I I M I I I I I I M I I M I I I M I I I I I I I 

Db 7 64 AGTCCCATGGTTAGCTGTGAA-CCCCTGGCGTCCTGCTCACCGGCCCAGTCATCCTGGGT 706 

Qy 1771 ATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTC-AAAAAGCAGTC 1829 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II II I I I I I I I II II I I I 
Db 705 ATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGAGGCCTGCGCCTCAAAAAAACAGTC 646 

Qy 18 30 GTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTA 18 8 9 

M I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 645 GTGCGAGGACAGCTGGGAGTATGTCCTGCACCTGGGCGAGNAGGCGCCCTCCCACCTCTA 586 

Qy 1890 CTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGC 1949 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I 

Db 585 CTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGC 526 

Qy 1950 CCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGC 2009 

II II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I II I I I 

Db 525 CCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGC 466 

Qy 2010 GCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCA 2069 

II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 65 GCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCA 406 

Qy 2070 CGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGA 2129 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I II I I I I I I I I II I I I II 

Db 405 CGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGA 346 

Qy 2130 GCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGT 2189 

II II I I M II II I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 345 GCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGT 286 

Qy 2190 GCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCA 2249 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I M I I I I I I I I I I II II I I I I I I 
Db 285 GCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCA 226 

Qy 2250 CATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCC 2309 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I II I I 
Db 225 CATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCC 166 



Qy 



2310 CAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAG 2369 



I I I I M I I I I I I I I M M I I I I I M I I I I I I I I I I I M I I I I I I I I I I M M 1 I I I I I I I 

Db 165 CAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAG 106 

Qy 2370 CTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGA 2429 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I M I M I I I I I I I I I I I I I I I I I I I I 

Db 105 CTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGA 46 

Qy 2430 GAGTGTyVGCGGGGGTCCCAGCCCTGGTGGGCCCC 2463 

I I I I M I I I I I I I I I I II M II I M III 

Db 45 GAGTGAAGCGGGGGTCCCAGNCCCTGGTGGGCCC 12 



RESULT 12 

BI818609 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 



BI818609 818 bp mRNA linear EST 04-OCT-2001 

603033362F1 NIH__MGC_115 Homo sapiens cDNA clone IMAGE: 5174559 5', 
mRNA sequence. 
BI818609 

BI818609.1 GI: 15929902 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 818) 
NIH-MGC http://mgc.nci.nih, gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg^ Ph.D. 

Email : cgapbs-r@mail . nih. gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E, Consortium/LLNL at: 
http : //image . llnl . gov 

Plate: LLAM11434 row: 1 column: 16 
High quality sequence stop: 744. 

Location/Qualifiers 

1. .818 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone="IMAGE: 5174559" 

/lab_host="DH10B" 

/ cl one_l ib= " NI H_MGC_1 15" 

/note="Organ : pooled brain, lung, testis; Vector: 
pCMV-SP0RT6; Site_l: NotI; Site_2 : EcoRV (destroyed); RNA 
source anonymous pool of 6 male brains, age range 23-27; 1 
male lung, age 27; and 1 male testis, age 69. Library is 
oligo-dT primed and directionally cloned (EcoRV site is 
destroyed upon cloning). Average insert size 1.8 kb, 
insert size range 1-3 kb. Library is normalized and 
enriched for full-length clones and was constructed by C. 
Gruber (Invitrogen) . Research Genetics tracking code 
021. Note: this is a NIH MGC Library." 



ORIGIN 



Query Match 26.0%; Score 716.4; DB 12; Length 818; 

Best Local Similarity 91.1%; Pred. No. 3.2e-112; 

Matches 748; Conservative 0; Mismatches 16; Indels 2; Gaps 2; 

Qy 43 GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGG 102 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGG 60 

Qy 103 CTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAAC 162 

I I I I I I I I I I M I I I I I M I I M I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I 

Db 61 CTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAAC 120 

Qy 163 CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAG 222 

I I I I I I M I I M I I I M I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I 

Db 121 CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAG 180 

Qy 223 CCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGG 282 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGG 240 

Qy 283 GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 342 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 241 GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 300 

Qy 343 CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTG 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTG 360 

Qy 403 GAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAG 462 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I M I I I I I I I 

Db 3 61 GAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAG 420 

Qy 463 GCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAG 522 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 421 GCCTACATCCGCATAGCCTATTTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAG 480 

Qy 523 GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 582 

I I I I M M I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 54 0 

Qy 583 GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 642 

I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 541 GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 600 

Qy 643 ATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTAC 702 

I I I M I I I I I I I I M M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 ATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTAC 660 

Qy 703 ACCTGCGTGGCCAAGAACATCGTGGCACGTC-GCCGCAGCGCCTCCGCTGCTGTCATCGT 761 

I I I I I I I I I I I I I M I I M I I I I I I I I I M I I I I I I M I I I I I I I M I I I I I I I M I I 
Db 661 ACCTGCGTGGCCAAGAACATCGTGGCACGTCAGCCGCAGGGCCTCCGCTGCTGTCATCGT 720 

Qy 762 CTACGTGAACGGTGGGTGGTCGACG-TGGACCGAGTGGTCCGTCTG 806 

I I I I I I I I I M I I I I I I I I I I I I M I I M I I I I 

Db 721 CTACGTGGACGGCAGCTGGAGCCCGTTGGAGCCAGTGGTCGGGCTG 766 
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FEATURES 

source 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BX364574 909 bp mRNA linear EST 05-MAY-2003 

BX364574 Homo sapiens NEUROBLASTOMA COT 25-NORMALIZED Homo sapiens 
cDNA clone CS0DC023YB10 5-PRIME, mRNA sequence. 
BX364574 

BX364574.1 GI: 3 0368 812 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 909) 

Li^W.B., Gruber,C.^ Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished (2001) 
Contact: Genoscope 

Genoscope - Centre National de Sequencage 

BP 191 91006 EVRY cedex - France 

Email: seqref (§genoscope . ens . f r, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 

Invitrogen. This sequence belongs to sequence cluster 3529. f For 
more information about this cluster^ see 
http: //www. genoscope. ens . f r/ 

cgi-bin/cluster . cgi?seq=CSlAC006ZF06QPl&cluster=3529. f . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS1AC006ZF06QP1 . 

Location/Qualifiers 

1. .909 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DC023YB10" 

/tissue_type="NEUROBLASTOMA COT 25-NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 25-NORMALIZED" 
/note="lst strand cDNA was primed with a Notl-oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 



ORIGIN 



Query Match 25. 1%; 

Best Local Similarity 99.0%; 
Matches 703; Conservative 



Score 692; DB 13; Length 909; 
Pred. No. 5.1e-108; 
3; Mismatches 3; Indels 



1; Gaps 



1; 



Qy 



Db 



1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 

I I M : I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
201 CCGCRGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 260 



Qy 

Db 



61 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
261 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 320 



Qy 121 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 



Db 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I : I I I I I I M I I I M M I I I I I I 

321 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCYAACCCGGACCTGCTTCCCCAC 380 



Qy 181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I 

Db 381 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 44 0 

Qy 241 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 

I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 441 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 500 

Qy 301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 501 GACCACGTGATCGAGCGCA-SACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 559 

Qy 361 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 560 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 619 

Qy 421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 620 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 679 

Qy 481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 540 

I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 
Db 680 TATTTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 739 

Qy 541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 

I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 740 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 7 99 

Qy 601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 

I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 800 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 859 

Qy 661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGT 710 

I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

Db 860 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGT 909 
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GSS 12-DEC-2003 
partial sequence. 



AY411748 2532 bp DNA linear 

Pan troglodytes HCM4327 gene, VIRTUAL TRANSCRIPT, 
genomic survey sequence. 
AY411748 

AY411748.1 GI:39767716 
GSS. 

Pan troglodytes (chimpanzee) 
Pan troglodytes 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 

1 (bases 1 to 2532) 

Clark,A.G., Glanowski, S • , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A., Tanenbaum,D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Pan. 



TITLE Inferring nonneutral evolution from human- chimp -mouse orthologous 

gene trios 

JOURNAL Science 302 (5652), 1960-1963 (2003) 
PUBMED 14671302 
REFERENCE 2 (bases 1 to 2532) 

AUTHORS Clark, A. G., Glanowski, S . , Nielson,R., Thomas,?., Ke jariwal, A. / 
Todd, M. A., Tanenbaum, D.M. , Civello, D.R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J . J. , 
Adams, M.D. and Cargill^M. 
TITLE Direct Submission 

JOURNAL Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 
COMMENT This sequence was made by sequencing genomic exons and ordering 

them based on alignment. 
FEATURES Location/Qualifiers 
source 1. .2532 

/organism="Pan troglodytes" 
/mo l_type=" genomic DNA" 
/ db_xr e f = " t axon : 9 5 9 8 " 
gene <1. •>2532 

/locus_tag="HCM4327" 

ORIGIN 

Query Match 24.6%; Score 678.2; DB 29; Length 2532; 

Best Local Similarity 54.4%; Pred. No. 1.9e-105; 

Matches 1370; Conservative 0; Mismatches 1021; Indels 129; Gaps 9; 

Qy 352 GAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAG7SJVGGTGTTCGGGCTGGAGGAATAC 411 

I I I I I I II I II I I I I I I I I II II I I I I I I II I I I I I I I I II I I I I 

Db 13 GAGGTGCAGATCGAGGTGTCGCGGCAGCAGGTGGAGGAGCTCTTTGGGCTGGAGGATTAC 72 

Qy 412 TGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATC 471 

II I II II I II I M I I II I I I I I I I I I I I I I I II I II I II I I II I I I I I I II 

Db 73 TGGTGCCAGTGCGTGGCCTGGAGCTCTGCGGGCACCACCAAGAGTCGCCGAGCCTACGTC 132 

Qy 472 CGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTG 531 

I I I I I III 

Db 133 CGCATCGCCTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 192 

Qy 532 GAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAG 591 

I I I I I 

Db 193 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTGGAA 252 

Qy 592 TGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGG 651 

I I I I I I I I I I II II I I I I I I I I I I M II I M I I I I I I 

Db 253 TGGCTCAAGAATGAGGATGTCATCGACCCCACCCAGGACACCAACTTCCTGCTCACCATC 312 

Qy 652 GAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTG 711 

I I I I I I I I I I I I I I I I I I II II I I I I I I II M I I I I I I I M II I I I I 
Db 313 GACCACAACCTCATCATCCGCCAGGCCCGCCTGTCGGACACTGCCAACTATACCTGCGTG 372 

Qy 712 GCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAAC 771 

I I I I I I I I I I I I I II I I I I I II III II I II I I I I I I I I I I I II I II I I 

Db 373 GCCAAGAACATCGTGGCCAAACGCCGGAGCACCACTGCCACCGTCATCGTCTACGTGAAT 432 



Qy 



772 GGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGG 831 

I I I I I I I II I I I I I II I I I M I MM I I I M II II I MM 



Db 433 GGCGGCTGGTCCAGCTGGGCAGAGTGGTCGCCCTGCTCCAACCGCTGTGGNNGANNCTGG 492 

Qy 832 CAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAG 8 91 

I I I I I II I I I I I I I I I I I I I I I I I I II II I I II I I M II I I I I III 

Db 493 CAGAAGCGCACCCGGACCTGCACCAACCCCGCCCCACTCAACGGAGGGNNNNTCTGCGAG 552 

Qy 892 GGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGC 951 

II I I I II II II II 

Db 553 GGCCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCGATGGGGCGTGGACG 612 

Qy 952 CCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGC 1011 

I M M I I I I I I I I M I I I I I I I I II I I I I M I I I I I I I I I I I I M I 

Db 613 GAGTGGAGCAAGTGGTCAGCCTGCAGCACTGAGTGTGCCCACTGGCGTAGCCGCGAGTGC 672 

Qy 1012 TCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGC 1071 

I II I I I I II I I I I I I I I I I I II I I I I I I I I I I 

Db 673 ATGGCGCCCCCACCCCAGAACGGAGGCCGTGACTGCAGCGGGACGCTGCTCGACTCTAAG 732 

Qy 1072 AACTGTACCAGTGACCTCTGTGTACACAGTGCTTCT 1107 

I I I I I I I II I I I I I II I I 

Db 733 AACTGCACAGATGGGCTGTGCATGCAAAATAAGAAAACTCTAAGCGACCCCAACAGCCAC 792 

Qy 1108 GGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC ATCGCCGTGGCCGTC 1155 

I I I I I I I I I I I II I I II I II I II I I I I I I I I 

Db 793 CTGCTGGAGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCGTCTTCNTG 852 

Qy 1156 TGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTG 1215 

MM I I I I I II I I M I I I I I I I I I I 

Db 853 GTCNTGGCAATCCTCATGGCGGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTC 912 

Qy 1216 GACTCAGATGTGGCTGACTCGTCCATT CTCACCTCAGGCTTCCAGCCCGTCAGCATC 1272 

II I I I I I I I M I I I I M I I I I I II I I I II I I I I I I I I I 

Db 913 GACACAGACATCACTGACTCATCTGCTGCCCTGACTGGTGGTTTCCACCCCGTCAACTTT 972 

Qy 1273 AAGCCCAGCAAAGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGC 1326 

I I I I I I I I I I I I I I I I I I II M I I I I I I I I 

Db 973 AAGACGGCAAGGCCCAGCAACCCGCAGCTCCTACACCCCTCTGTGCCTCCTGACCTGACA 1032 

Qy 1327 ACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGG 1374 

I I I I I I I I I I I I I M I I I I I I I I I I I I 

Db 1033 GCCAGCGCCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATC 1092 

Qy 1375 CCCAGCCCCAAGTTCCAGCTCACCA — 1399 

M I I I I I I I I I I I I II 11 
Db 1093 CCCATGACCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGC 1152 

Qy 1400 ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCC ACACA 1440 

I M M I I I I I I I I I I M I I I 

Db 1153 TCCAGCACTACGGGCTCTGGGCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGAGTCTTG 1212 

Qy 1441 CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 

III I I I II I I I MINI I M I I I I I 

Db 1213 CCGCCTGGCACATACCCTAGCGATTTCGCCCGGGACACCCACTTCCTGCACCTGCGCAGC 1272 

Qy 1501 AACTACTTCCGCTC CCTGCCCCGAGGCACCAGCAACATGACCTAT 1545 

I I I I I M I I I I M II I I I I I III I 

Db 1273 GCCAGCCTCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGNNAGCAGCGTCAGC 1332 



Qy 1546 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

I I I I I M I I I I I I I I I 

Db 1333 GGCACCTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGTCAGCTTGCTG 1392 

Qy 1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 1393 GTGCCCAATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAG 1452 

Qy 1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

MM II MM II II I I I M I III I I I I 

Db 1453 GCAGAAAGTACCCTCCCGCTTTCAGTJ^GGGACCCAGACAGTATTGAGCCCCTCAGTGACC 1512 

Qy 1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

M I M M II I II I II II II I II II I II M III I I I I I II M I I I 

Db 1513 TGTGGACCCACAGGCCTCCTGCTGTGCCACCCCGTCATCCTCACCATGCCCCACTGTGCC 1572 

Qy 1786 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTC7UWUVGCAGTCGTGCGAGGGCAGCTGG 1845 

II II I I II I M I I I M M I 1 II I I I I II II I I 11 

Db 1573 G75JVGTCAGTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCAGGGCCACTGG 1632 

Qy 1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I I I I I I I I II II I I I I I II II I I I I I I I I M I I M I M I M II 

Db 1633 GAGGAGGTGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAG 1692 

Qy 1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

I I I I I II I I I I I I I I I M I II I M I II I I I II I M II 

Db 1693 CCCAGGGCCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCC 1752 

Qy 1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 

I III III I I I I I I I I I I I I I I II I II 

Db 1753 TATTCCCNNNNNNCAGTCNNNNNGCTCCNGCTGGCCGTCTTNGCCCCTGCCCTCTGCACC 1812 

Qy 2026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

I I M I M I I II I I M I II II M M I III 

Db 1813 TCCCTGGANNNNNGCCTCCGGGTCTACTGCCTGGNNNNNNNNNNNNNNNNNNNNNNNGAG 1872 

Qy 2 086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

I I I I I I I I I I I I I I M I II I II I III I I I II I I II 
Db 1873 GTGCTGGAGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATG 1932 

Qy 2146 TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2205 

I I I I I II I I I I I I M I M I M I I II I II I I I I II II I I I I I I Ml Ml 

Db 1933 TTCAAGGACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGG 1992 

Qy 2206 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2265 

I III Mill M I II lllllllllllllllll I llllll MM Mill 

Db 1993 AGGAGCAAGCTGCTGGCCAAATACCAGGAGATCCCCTTCTGTCACATTTGGAGTGGCAGC 2052 

Qy 2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

III I I I II II I II I M II I II I II M I MM III II I I 

Db 2053 CAGAAGGCCCTCCACTGCACTTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTC 2112 

Qy 2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

M I II I II I II I M II II II I I I M II II II II III I I I 

Db 2113 ACCTGCAAGATCTGCGTGCGGCAAGTGGAAGGGGAGGGCCAGATATTCCAGCTGCATACC 2172 



Qy 2386 AACATCACCAAG GACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 2442 

I I I I I III II II I II 

Db 2173 ACTCTGGCAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACT 2232 

Qy 2443 GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 2502 

II I I M I I I I I I M II II N M I I I II I I I I II M II I M I I I I 

Db 2233 GTCACCACCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATA 2292 

Qy 2503 ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 2562 

I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I II 

Db 22 93 TGCAACAGCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAG 2352 

Qy 2563 CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 2622 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 2353 CTCTCTATGGACCGGTACCTGAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATC 2412 

Qy 2623 CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 2682 

II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II 

Db 2413 CTGGACCTCTGGGAAGCTCTGCAGCAGGACGATGGGGACCTCAACAGCCTGGCGAGTGCC 2472 

Qy 2683 GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

III I I I I I I I I II I I I I I I M I I I I II 

Db 2473 TTGGAGGAGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGGGGACTGCTGA 2532 



RESULT 15 

BF311804 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BF311804 934 bp mRNA linear EST 21-NOV-2000 

601897316F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE: 4126706 5', 

mRNA sequence. 
BF311804 

BF311804.1 GI: 11259566 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 934) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Healthy Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r(5mail . nih . gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: image.llnl.gov 
Plate: LLCM1016 row: p column: 03 
High quality sequence stop: 7 07. 
Location/Qualifiers 
1. .934 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="IMAGE: 4126706" 



/tissue_type="neuroblastoma " 
/lab__host="DH10B (phage-resistant ) " 
/clone_lib="NIH_MGC_19" 

/note="Organ: brain; Vector: pOTB7 ; Site_l : Xhol; Site_2 : 
EcoRI; cDNA made by oligo-dT priming. Directionally 
cloned into EcoRI/XhoI sites using the following 5' 
adaptor: GGCACGAG(G). Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 



ORIGIN 



Query Match 24.4%; 

Best Local Similarity 93.3%; 
Matches 747; Conservative 



Score 672.6; DB 10; 
Pred. No. l.le-104; 
0; Mismatches 49; 



Length 934; 
Indels 5; 



Gaps 



4; 



Qy 1116 GGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCT 1175 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 GGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCT 61 



Qy 

Db 

Qy 

Db 



117 6 TGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTC 12 35 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

62 TGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTC 121 

1236 GTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGC7yU\GCAGACAACCC 12 95 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

122 GTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCC 181 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



12 96 CCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCT 1355 
I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M 
CCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCT 241 



182 



1356 CTGTCCCCGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAG 1415 

I I I M I I I I I M I I I I I I I I I M I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I 

CTGTCCCCGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAG 



242 



301 



1416 CCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGA 14 75 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I 
302 CCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGA 361 

1476 GTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAA 1535 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I M I I M I I I I I I I 
362 GTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAA 421 

1536 CATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTAT 1595 
I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
422 CATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAAT 481 

1596 CAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCAC 1655 

III I M II I II I I I I II II I I I I I I II II I I I I M I I I II I II I II I I I M II M II I I 

CAG-CTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCAC 540 



482 



1656 GCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCC 1715 
II I II I I II II I I II II II I II M II II I II I I I I I I I I I I I I I I I I M I I I I I I I II I 
541 GCTGCACAAGCCGGAAGACGTGAGGTTG-CCCTAGCTGGCTGTCAGACCCTGCTGAGTCC 599 



Qy 1716 CATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCC — TGGCTATG 1773 

I I I M I I I I I I I I I I I M I I I I I I I M M I I I I I I MM M M M M I I Ml 

Db 600 CATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCAACCGGGCAGTCATCCCTGGCTAATG 659 

Qy 1774 GACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAA7\AAGCAGTCGTGC 1833 

MM Ml I M M M M M M M M I M M M I MM 

Db 660 GACCCACTGTGGGGAAGCCAGCCTGACAGTTGGAGCCTGGGCTCAAAAAGGAGTCGGTGC 719 

Qy 1834 GAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTAC 1893 

M M M M M M M M M M M I M Ml M I M I I M II II I I II II 

Db 720 GAGGGCAGCTGGGA-GATGTGCTGGACTGGGGGAAGAAGGGGGCCTCCACCCTCTATTAT 778 

Qy 1894 TGCCAGCTGGAGGCCAGTGCC 1914 

II III II II II II II 

Db 779 TGGCAGTGGGAGGCAAGGCCC 799 

Search completed: July 12, 2004, 19:58:13 
Job time : 6792 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search^ using sw model 



Run on : 



July 12, 2004, 14:36:51 



Search time 10439 Seconds 
(without alignments) 
11426.385 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-624-932-1 

2752 

1 ccgcggggccccgcgcccgg , 



, tgagtgctgaggccggccag 2752 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 3470272 seqs, 21671516995 residues 

Total number of hits satisfying chosen parameters: 6940544 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : GenErabl : * 



1- 




gb ba : * 


2 




gb htg:* 


3 




gb in:* 


4: 




gb_om : * 


5 




gb ov : * 


6 




gb pat:* 


7 




gb ph : * 


8 




gb_pl : * 


9 




gb pr:* 


10: 


gb ro:* 


11: 


gb sts:* 


12: 


gb s y : * 


13 


gb un : * 


14 


gb vi : * 


15 


em ba : * 


16 


em_f un : * 


17 


em hum: * 


1( 


3 


em in:* 


19 


em mu : * 


20 


: em om:* 


21 


em or:* 


22 


: em ov:* 


23 


: em pat : * 


24 


: em ph : * 


25 


: em_pl : * 


26 


: em ro : * 


27 


: em sts:* 



o o 
o 


em 


un t * 


O Q 

/L y 


erri 


vi : 


Q n 
oU 


em 


iiuy num. ^ 


O 1 


: em 


htg inv:* 


o o 


em 


Vn y-y -w ■ ?Ar 

nug otiner : 


33 


em 


htg mus : * 


o4 


1 em 


Hug p±n . ^ 


io 


em 


htg rod : * 




em 


11 U y iUdlLL « 


37 


: em_ 


htg_vrt : * 


38 


: em_ 


sy : * 


39 


; em_ 


htgo_hum: * 


40 


em 


htgo mus : * 


41 


em_ 


htgo other:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



Fo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2752 


100 


.0 


2752 


6 


AX449372 


AX44yD/2 Sequence 


2 


2687 . 4 


97 
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2 697 


6 
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AX4c)1ddz Sequence 
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2676 . 4 
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6 
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Au 4 o / odz mu s mu s cu 


6 


2259 


82 


.1 


3014 


6 


BDUo /024 


dJJUo/oz^ JNetrin re 


7 


2252.2 


81 


.8 


2697 


6 


AX268596 


AX268596 Sequence 


8 


2252.2 


81 


.8 


2697 


10 
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9 
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72 
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41 
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.7 
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6 
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Sequence 


42 


480. 


4 


17 


.5 


2406 


6 
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Sequence 


43 


480. 


4 


17 


.5 


2406 


9 
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Homo sapi 


44 


478. 
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.4 
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Sequence 
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T^^IGNMENTS 



RESULT 1 
AX449572 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AX449572 



2752 bp 



DNA 



linear 



PAT 03-JUL-2002 



Sequence 1 from Patent WO0210216. 
AX449572 

AX449572.1 GI: 21698195 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Padigaru^M., Mezes^P., Mishra,V.^ Burgess^ Casman^S., 
Grosse,W.M. , Alsobrook^ J. P. , Lepley^D.M. , Gerlach, V.L. , 

Macdougally, J . R. and Smithson^G. 
Proteins and nucleic acids encoding same 
Patent: WO 0210216-A 1 07-FEB-2002; 
Curagen Corporation (US) 

Location/Qualifiers 

1. .2752 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon:9606" 



Query Match 100.0%; Score 2752; DB 6; Length 2752; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2752; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 

Db 



1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 6 0 

I I I I I I I I I I I M I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M 

1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 6 0 



Qy 

Db 



61 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I 
61 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 



Qy 

Db 



121 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 

M I I I I I I I I I I I M I I I I I I I M I I M I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I 

121 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 



Qy 181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I M I I I I I I I I I I I I I I 
Db 181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

Qy 241 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 241 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 

Qy 301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

Qy 361 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I 
Db 361 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 

Qy 421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

Qy 481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 54 0 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 540 

Qy 541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 

I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 

Qy 601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I 
Db 601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 

Qy 661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I 
Db 661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 720 

Qy 721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 780 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I 
Db 721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 780 

Qy 781 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 840 

I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 781 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 84 0 

Qy 841 AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I M I 
Db 841 AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 900 

Qy 901 GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 960 

I I I M I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I 

Db 901 GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 960 

Qy 961 AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 1020 

I I I I I I I I M I I 1 I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 961 AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 1020 



Qy 1021 GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 1080 

I I I I I M I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 1021 GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 108 0 

Qy 1081 AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 1140 

I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 1081 AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 1140 

Qy 1141 ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 1141 ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 1200 

Qy 1201 AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 1260 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 1260 

Qy 12 61 CCCGTCAGCATCAAGCCCAGCAT^GCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 132 0 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 1320 

Qy 1321 CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 1380 

Qy 1381 CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 14 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 1381 CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 1440 

Qy 1441 CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
Db 1441 CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 

Qy 1501 AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 1560 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 1560 

Qy 1561 CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 1620 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 1620 

Qy 1621 ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGG 168 0 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1621 ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGG 1680 

Qy 1681 TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 1740 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 1740 

Qy 1741 GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 1800 

I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 1741 GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 1800 

Qy 1801 AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 1801 AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 

Qy 1861 CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 1920 



I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M M I I I I I I I I I M I I I I 

Db 1861 CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 1920 

Qy 1921 GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 1980 

I I I I M I I I I I I I I I I I I M I I I I I I M I I I I I M I I M I I I M I I I I I I I I I I I I I I I I 

Db 1921 GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 198 0 

Qy 1981 GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I M I I I 
Db 1981 GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 204 0 

Qy 2041 ATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAG 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 041 ATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAG 2100 

Qy 2101 AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 

I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2101 AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 

Qy 2161 CACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTT 2220 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2161 CACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTT 222 0 

Qy 2221 GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 2280 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I M I I 
Db 2221 GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 2280 

Qy 2281 TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 2340 

I I I I I I I I I M I I I I I M I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

Db 2281 TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 2340 

Qy 2341 GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCTiAGGAC 2400 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2341 GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 24 00 

Qy 2401 ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 24 60 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I 

Db 2401 ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 2460 

Qy 2461 CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

I I I I I I M I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I 

Db 2461 CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

Qy 2521 CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 258 0 

I I I I I I I I I I I M I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

Db 2521 CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 258 0 

Qy 2581 CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 2640 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I 
Db 2581 CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 2640 

Qy 2641 CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 

I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 2641 CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 



Qy 



2701 CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 



Db 



2701 CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 
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AX451652 
Sequence 1 
AX451652 
AX451652.1 



,.,..^- 2697 : bp- 

from Patent WO0233080. 

GI:21698587 



.DNA 



linear PAT 03-JUL-2002 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Koehler.R.H. 

Regulation of human netrin binding membrane receptor unc5h-l 
Patent: WO 0233080-A 1 25-APR-2002; 
Bayer Aktiengesellschaf t (DE) 

Location/Qualifiers 

1. .2697 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon: 9606" 



Query Match 97.7%; 
Best Local Similarity 99.8%; 
Matches 2691; Conservative 



Score 2687.4; 
Pred. No. 0; 
0; Mismatches 



DB 6; Length 2697; 

6; Indels 0; Gaps 



Qy 


46 


Db 


1 


Qy 


106 


Db 


61 


Qy 


166 


Db 


121 


Qy 


226 


Db 


181 


Qy 


286 


Db 


241 


Qy 


346 


Db 


301 


Qy 


406 


Db 


361 



ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 

I I I I I I I I M I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 

CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I 
CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 



0; 



105 



60 



165 



120 



GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 225 

M I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I M I I M I I I M I I I I I 

GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 



GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 

I M I I I I I I I I I I I I I I I I I I I I I I 1 I I I I M I I I I I M I I I I I I I I I I I I M I I I I M I 

GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 
TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 

I I M I I I I I I M I I I M I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I 

TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 

ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 
I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I M I I I I I I I I I I I I I I I I 
ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 

GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I 

GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 



180 



285 



240 



345 



300 



405 



360 



465 



420 



Qy 466 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

I I I I I I I I I I I I I I I }'l I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I 
Db 421 TACATCCGCATAGCCIAI^TTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCC;^ 4 8 0 

Qy 52 6 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 585 

I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I M M I I I I I I I I M I I I I I I I I I I I 

Db 481 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 54 0 

Qy 586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 600 

Qy 646 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 705 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I 
Db 601 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 660 

Qy 7 06 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 765 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 661 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 72 0 

Qy 7 66 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 825 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 780 

Qy 826 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 885 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 7 81 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 84 0 

Qy 88 6 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i; I I I I I I I I I 

Db 841 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTGPACGGCAGC 900 

Qy 946 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 1005 

I I I I I I I I I I M I I I I I I M I I I I I M I I I I I I I I I I I I M M I I I I I I I I I I I I I I I M 

Db 901 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 960 

Qy 1006 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M 
Db 961 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1020 

Qy 1066 ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I U' I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCCGCAACTGTACCAGTGACCTCTGTGTACACAcxfeCTTCTGGCCCTGAGGACGTGGCC 1080 

Qy 1126 CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1185 

I I I I I M I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1140 

Qy 1186 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 1141 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1200 

Qy 124 6 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1305 

I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1260 



Qy 1306 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

I M I I I I M I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M M I I I 

Db 12 61 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1320 

Qy 1366 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 1321 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1380 

Qy 1426 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1485 

I I I M I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I 

Db 1381 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1440 

Qy 1486 CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 1545 

I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 1441 CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 1500 

Qy 1546 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I M I I I M I I I I I I I I 

Db 1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTC 1560 

Qy 1606 ATCCCCCCAGATGCCATACCCCGAGGG7\AGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I M I I I I I M I I I 
Db 1561 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1620 

Qy 1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1680 

Qy 1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 1681 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1740 

Qy 178 6 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 18 45 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I M M I I I I I I I I I 
Db 1741 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1800 

Qy 1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I I I I I I I I I I I I I I M I I I I I I I M M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 1801 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1860 

Qy 1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1861 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1920 

Qy 1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1921 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 1980 

Qy 2026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 1981 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2040 

Qy 2086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 2041 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2100 

Qy 2146 TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2205 



Db 2101 TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2160 

Qy 22 06 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2265 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 2161 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 222 0 

Qy 2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2221 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2280 

Qy 2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2281 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 234 0 

Qy 2386 AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 2445 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2341 AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGT^GCGGGGGTC 2400 

Qy 2446 CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2505 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2401 CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2460 

Qy 2506 TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 2565 

I I I I I I I I I M I I I I 1 I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2461 TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 2520 

Qy 2566 CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2625 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I 

Db 2521 CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2580 

Qy 2626 AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 2685 

I I I I I I I I M I I I I I I M I I M I I I I I I I I I I I I M I I M I M I I I I I I I I I I I I I I I I I 

Db 2581 AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 264 0 

Qy 2 686 GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 641 GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2697 
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FEATURES Location/Qualifiers 
source 1. .2881 

/organism="Homo sapiens" 
/inol_type="unas signed DNA" 
/db_xref="taxon: 9606" 

ORIGIN 

Query Match 97.3%; Score 2676.4; DB 6; Length 2881; 

Best Local Similarity 98.9%; Pred. No, 0; 

Matches 2728; Conservative 0; Mismatches 21; Indels 9; Gaps 3; 
Qy 1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I 

Db 42 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 101 

Qy 61 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 

I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 102 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 161 

Qy 121 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 18 0 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I M I I I I I I I I I I M I I I I M I I I 

Db 162 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 221 

Qy 181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 222 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 281 

Qy 241 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 

I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I M I I M I I I I I I I I I I I I I I I M I M 

Db 282 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 341 

Qy 301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I II I I I I I I I I II I I 

Db 342 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGTGAGCCGACCATGGAGGTCCGC 401 

Qy 361 ATTJ^ATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 

II I I II I II II I I II I M II I II I II I I II I I I I I II I I I I II II I M II I I I I I II I I I 

Db 402 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 4 61 

Qy 421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

I II II I I I I I I I M I II M M I I I I I I I M II II I I I I I I I I I II I I I I I I I II I I I I II 
Db 462 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 521 

Qy 481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 54 0 

I I I I II II I I I I I II I II I I I I I I I I I I I I M M I II II I II II I I I I I I I I I I I I I I I I 
Db 522 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 581 

Qy 541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 

I I M I I I I I M I I I I I II I I I I I I II II I I I I I I II I II I I I I I II I M I I II I I I I I I I 
Db 582 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 641 

Qy 601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 

I II I I I II II I I M II II M I I II I II II M I II II I II II II I II I M I I I I I I I I I I I 

Db 642 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 701 



Qy 



661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 720 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I 
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Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



7 02 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCT^AGAAC 7 61 



721 



762 



781 



822 



841 



882 



901 



942 



958 



ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 78 0 

M I I I I M I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I M I I I I I I M I I I I I I I I I 

ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 821 



TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 

I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I M M I I I I I I I I I I I I I 

TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 

AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 

I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I M I I 
AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 



840 



881 



900 



941 



957 



GTCCAGAA AACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGG 

I I I I I I I I I I I I I II II I I I I I I I I I I II I II II I I I I I I 

GTCCATGACCGCACCGTCTCCTCTCTGCTTGTCTCTGTGGACGGCAGCTGGAGCCCGTGG 1001 



AGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGAC 1017 

II I I II II II M I II II II II I I II M II I II II I II II II I I II II II I II II I I II M 
1002 AGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGAC 1061 

1018 CCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGT 1077 

I I I I I I I I I II I I I I M I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1062 CCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGT 1121 

1078 ACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGC 1137 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1122 ACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGC 1181 

1138 CTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGC 1197 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1182 CTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGC 1241 

1198 CGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTC 1257 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1242 CGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTC 1301 

1258 CAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCG 1317 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1302 CAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCG 1361 

1318 GACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCC 1377 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I 
1362 GACCTCAG CACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCC 1418 

1378 AGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCAC 1437 

I I I I I I I I I I I I I II I I I I I I I M I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1419 AGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCAC 1478 

1438 ACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACC 1497 

I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1479 ACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACC 1538 

1498 CAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAAC 1557 

I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1539 CAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAAC 1598 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1558 TTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGAT 1617 

I I I I I I I I I I M I I I I I I M I I I M I I I I I I I I I I I M I I I M I I I I I I M I I I I I I I I I 

1599 TTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGAT 1658 

1618 GCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTG 1677 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M 
1659 GCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTG 1718 

167 8 AGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCT 1737 

I I I I I M I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I M 

1719 AGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCT 177 8 

1738 GGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCT 17 97 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I 
1779 GGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCT 1838 

1798 GACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGG AGGATGTG 1854 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I M I I I I I I I I I M I 

1839 GACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGCAGGATGTG 18 98 

1855 CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCC 1914 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
1899 CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCC 1958 

1915 TGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTG 1974 

I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

195 9 TGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTG 2018 

1975 GCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAG 2034 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2019 GCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAG 2078 

2035 TACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAG 2094 

I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I M I I I M M I I I I I I I I M I I I 

207 9 TACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAG 2138 

2095 CTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGAC 2154 

I I I M I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I M I I I I I I I I 
2139 CTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGAC 2198 

2155 AGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAG 2214 

I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

2199 AGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTT^G 22 58 

2215 CTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTAC 2274 

I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
2259 CTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTAC 2318 

2275 TTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAG 2334 

I M I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2319 TTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAG 237 8 

2335 CTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACC 2394 

I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M M I I I 
2379 CTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACC 2438 



Qy 2395 AAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTG 24 54 

I I I I I I I Ml I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I M I I M M I I I I I 

Db 2439 AAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTG 2498 

Qy 2455 GTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTG 2514 

I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 2499 GTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTG 2558 

Qy 2515 GACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGAC 2574 

M I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2559 GACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGAC 2618 

Qy 2575 AGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGG 2634 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 2619 AGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGG 2678 

Qy 2635 GAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTG 2694 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 2679 GAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTG 2738 

Qy 2695 GGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2739 GGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2796 
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AX367094 3580 bp DNA linear PAT 16-FEB-2002 

Sequence 13 from Patent WO0198354. 

AX367094 

AX367094.1 GI: 18855296 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Griff in, J. A. , Kallick, D . A. , Tribouley, C .M. , Yue,H., Nguyen, D.B., 
Tang,Y.T., Lai, P., Policky, J. L. , Azimzai,Y., Lu,D.A., Graul,R., 
Yao,M.G., Burford,N., Haf alia, A. J. , Baughn,M.R., Bandman,0., 
Patterson, C. , Yang, J., Xu,Y., Warren, B. A., Ding,L. and 
Sanjanwala,M. S . 
Receptors 

Patent: WO 0198354-A 13 27-DEC-2001; 
Incyte Genomics, Inc. (US) 

Location/Qualifiers 

1. .3580 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 
/note="Incyte ID No: 6052371CB1" 



ORIGIN 



Query Match 85.6%; Score 2356; DB 6; Length 3580; 

Best Local Similarity 93.6%; Pred. No. 0; 

Matches 2537; Conservative 0; Mismatches 5; Indels 168; Gaps 



1; 



Qy 43 GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGG 102 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 1 GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGG 60 

Qy 103 CTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAAC 162 

I I I I I I M I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 61 CTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAAC 120 

Qy 163 CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAG 222 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAG 180 

Qy 223 CCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGG 282 

I I M I I I I I I I I I I I I I I I I I I I I I I M I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGG 240 

Qy 283 GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 342 

M I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 300 

Qy 343 CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTG 402 

I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M M I I I I I I I 

Db 301 CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTG 360 

Qy 403 GAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAG 4 62 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 361 GAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAG 420 

Qy 463 GCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAG 522 

I I I I I I I I I I I I I I I I I I I M I I I I M I M I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 421 GCCTACATCCGCATAGCCTATTTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAG 4 80 

Qy 523 GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 582 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 481 GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 540 

Qy 583 GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 642 

I I I I I I M I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I M I I I i I I 

Db 541 GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 600 

Qy 643 ATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTAC 702 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 601 ATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTAC 660 

Qy 703 ACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTC 762 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 ACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTC 720 

Qy 763 TACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGG 822 

I I I I I I 

Db 721 TACGTG 726 

Qy 823 CGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCT 882 

Db 727 726 

Qy 883 TTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGC 942 



Db 727 GACGGC 732 

Qy 943 AGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGC 1002 

I I I I I M I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 733 AGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGC 792 

Qy 1003 CGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTG 1062 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 793 CGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTG 852 

Qy 1063 GACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTG 1122 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I 

Db 853 GACACCCGCAACTGTACCAGTGACCTCTGTGTACACACTGCTTCTGGCCCTGAGGACGTG 912 

Qy 1123 GCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTC 1182 

I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I M I M I I I I I I I I I I I I I I I I I M I I I I 

Db 913 GCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTC 972 

Qy 1183 ATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT 1242 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I M I I I I I I I I 
Db 973 ATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT 1032 

Qy 1243 CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTG 1302 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1033 CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTG 1092 

Qy 1303 CTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCC 1362 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I 
Db 1093 CTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCC 1152 

Qy 1363 CGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTG 1422 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 1153 CGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTG 1212 

Qy 1423 GGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTC 14 82 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 1213 GGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTC 1272 

Qy 1483 TCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACC 1542 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1273 TCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACC 1332 

Qy 1543 TATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTC 1602 

I I I I I I I I I I M I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1333 TATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTC 1392 

Qy 1603 CTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCAC 1662 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I 

Db 1393 CTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCAC 1452 

Qy 1663 AAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTT 1722 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M 
Db 1453 AAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTT 1512 

Qy 1723 AGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGT 1782 

I M I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1513 AGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGT 1572 

Qy 17 83 GGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGC 1842 

I I I I M I I I I I I I I I I I I M I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I 
Db 1573 GGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGC 1632 

Qy 1843 TGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTG 1902 

I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

Db 1633 TGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTG 1692 

Qy 1903 GAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAG 1962 

M I I I I I I I I I I I I I I I I I I M M I I I I I I I I I M I I I I I I I I I M I I I M I I I I I I I I I 

Db 1693 GAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAG 1752 

Qy 1963 GCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGC 2022 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I 

Db 1753 GCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGC 1812 

Qy 2023 ACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAG 2082 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 1813 ACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAG 1872 

Qy 2083 GAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTG 2142 

I 1 I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 1873 GAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTG 1932 

Qy 2143 CACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTG 2202 

I M I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1933 CACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTG 1992 

Qy 2203 TGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGC 2262 

I I I I I I I I I I M I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I M I 

Db 1993 TGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGC 2052 

Qy 2263 ACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGAC 2322 

I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2053 ACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGAC 2112 

Qy 2323 CTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAAC 2382 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2113 CTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAAC 2172 

Qy 2383 TTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 2442 

I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I 
Db 2173 TTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 2232 

Qy 2 443 GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 2502 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 2233 GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 22 92 

Qy 2503 ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 2562 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2293 ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 2352 

Qy 2563 CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 2 622 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2353 CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 2412 



Qy 2623 CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 2682 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 2413 CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 2472 

Qy 2683 GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 2473 GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2532 

Qy 2743 GGCCGGCCAG 2752 

I I I I I I I I I I 
Db 2533 GGCCGGCCAG 2542 



RESULT 5 

MMU487852 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



MMU487852 3992 bp mRNA linear ROD 24-SEP-2002 

Mus mus cuius mRNA for netrin receptor UncShl (UncShl gene) . 
AJ487852 

AJ487852. 1 GI: 22035783 

netrin receptor Unc5hl; Unc5hl gene. 

Mus mus cuius (house mouse) 

Mus mus cuius 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Engelkamp, D . 

Cloning of three mouse Unc5 genes and their expression patterns at 
mid-gestation 

Mech. Dev. 118 (1-2), 191-197 (2002) 
22239710 

12351186 

2 (bases 1 to 3992) 
Engelkamp, D . 

Direct Submission 

Submitted ( 15-MAY-2002 ) Neuroanatomy, Max Planck Institute for 
Brain Research, Deutschordenstrasse 46, Frankfurt 60528, GERMANY 

Location/Qualifiers 

1. .3992 

/organism="Mus musculus" 
/mo l_t yp e = "mRNA " 
/db_xref="taxon: 10090" 
1. .3992 
/gene="Unc5hl" 
232. .2928 
/gene="Unc5hl" 
/codon_start=l 

/product="netrin receptor UncShl" 
/protein_id="CAD32250. 1" 
/db_xref="GI: 22035784" 
/db_xref="GOA:Q8KlS4" 
/db_xref="SPTREMBL:Q8KlS4" 

/translation="MAVRPGLWPALLGIVLTAWLRGSGAQQSATVANPVPGANPDLLP 

HFLVEPEDVYIVKNKPVLLVCKAVPATQIFFKCNGEWVRQVDHVIERSTDGSSGLPTM 
EVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEV 
SLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHSLWRQARLADTAN 
YTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLN 



GGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEEC 
RGADLDTRNCTSDLCLHTSSGPEDVALYIGLVAVAVCLILLLLVLVLIYCRKKEGLDS 
DVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQL 
SNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNFLGG 
RLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGV 
LLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGAC 
YVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEW 
QLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGT 
QQYLHCTFTLERWASTSDLACKVWVWQVEGDGQSFNINFNITKDTRFAEMLALESEG 
GVPALVGPSAFKIPFLIRQKIITSLDPPCSRGADWRTLAQKLHLDSHLSFFASKPSPT 
AMI LNLWEARHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC " 

ORIGIN 

Query Match 84.4%; Score 2323.2; DB 10; Length 3992; 

Best Local Similarity 90.3%; Pred. No. 0; 

Matches 2484; Conservative 0; Mismatches 268; Indels 0; Gaps 0; 
Qy 1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 

I I I I I II I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I M II 

Db 187 CCGCCCGCGGGGCCCCTAGTCCAGCCCGCTCGTCCGCCCGCGGCCATGGCCGTCCGGCCC 24 6 

Qy 61 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 

II II I I M M I I I II I I I II I I I I I I I I I II I I I II I I I I I II II I I I I I I I I I 

Db 247 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCACTGCCTGGCTTCGTGGTTCGGGTGCC 306 

Qy 121 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 18 0 

II I II I I I I I I I I I I I I I I I M I I I I M II I I II M I I I II I I II I M I I I I I I I I I 

Db 307 CAGCAGAGTGCCACAGTGGCCAACCCAGTGCCTGGTGCC/^CCCAGACCTGCTGCCCCAC 366 

Qy 181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGTVACAAGCCAGTGCTGCTTGTGTGC 240 

II M II II I I I II Mill I I I I I I I I I M I I I I I I I I I M I M I I I I I I I M M 

Db 367 TTCCTGGTAGAGCCGGAGGACGTGTACATTGTCAAGAACAAGCCCGTGCTGCTGGTGTGC 426 

Qy 241 T^GGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 

I I I I I I I I I I M I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 427 AAGGCTGTGCCCGCCACCCAGATCTTCTTCAAGTGCAACGGGGAATGGGTTCGCCAGGTC 486 

Qy 301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

II Mill Mill I II I II M Mill Mill II I II I II II I M M II I II 

Db 487 GATCACGTCATCGAACGCAGCACTGACGGCAGCAGCGGATTGCCAACCATGGAGGTCCGG 546 

Qy 361 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 42 0 

II II M M II I II II I I I II I II I I I II II I I II I I I II M I II II II II I I II 

Db 547 ATCAACGTATCAAGGCAGCAGGTCGAGAAAGTGTTTGGGCTGGAGGAGTACTGGTGCCAG 606 

Qy 421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 4 80 

II M I II II II M I I I I II M II II I I I I II II II II M I II II II I I II III 
Db 607 TGTGTGGCATGGAGCTCCTCAGGAACCACCAAAAGCCAGAAGGCCTACATCCGGATTGCC 666 

Qy 481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 540 

II II I M I M I I I I II I II I II I II I II II I II II M I I I II I II I I M I III 

Db 667 TATTTGCGCAAGAACTTTGAGCAGGAGCCGCTGGCCAAGGAAGTGTCACTGGAGCAAGGC 726 

Qy 541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 

M Mill II M II II Mill II II I I I Mill II II II II II II II II I 
Db 727 ATTGTGCTACCTTGTCGCCCCCCGGAAGGAATCCCCCCAGCTGAGGTGGAGTGGCTCCGA 786 



601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 

II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I 
787 AATGAGGACCTCGTGGACCCCTCCCTCGACCCCAATGTGTACATCACACGGGAGCACAGC 84 6 

661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 720 

II II I II I I M I I I I I I I II II I I I I I I I I I I I I M II I I M I I II I M I I I I I 

847 CTAGTCGTGCGGCAGGCCCGCCTGGCCGACACTGCCAACTACACCTGCGTGGCCAAGAAC 906 

721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 78 0 

I I I I I I I I I M I I I I I I I I I I I I I II II I I I I I II II I I I I I I I I I I I I I I I 
907 ATCGTGGCCCGTCGCCGAAGCGCCTCTGCGGCCGTCATTGTTTATGTG7\ACGGTGGGTGG 966 

781 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 840 

I I I I I M I I I II I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I 
967 TCGACGTGGACCGAGTGGTCCGTCTGCAGTGCCAGCTGTGGGCGTGGCTGGCAGAAACGG 102 6 

841 AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 900 
I I I I I II I I I I I I I I I I M I M I I I I I I I I I I I I I I I I II M I I I M I I I I I I I II I I 
1027 AGCCGGAGCTGCACCAACCCGGCACCTCTCTUVCGGGGGCGCCTTCTGTGAGGGGCAGAAT 1086 

901 GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I 
1087 GTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGCTGGAGCCCATGGAGT 1146 

961 AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1147 AAGTGGTCAGCCTGCGGGCTTGACTGCACCCACTGGCGGAGCCGGGAGTGCTCCGACCCA 1206 

1021 GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 1080 

II M I I M I I I M I I I I I I I II I I I I III I I I I I I I I I I I I M I II I I I I I I I M 

1207 GCGCCCCGCAACGGAGGTGAGGAGTGCCGGGGTGCTGACCTGGACACCCGCAACTGTACC 1266 

1081 AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II 
1267 AGTGACCTCTGCCTGCACACCTCTTCCGGCCCCGAGGACGTGGCTCTCTACATCGGCCTC 1326 

1141 ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 1200 

I I I I I I I II I I I I I II I I II I I I I I I I I I I II I I I I I I I I I I II I I I I I 
1327 GTCGCCGTGGCCGTGTGCCTCATCTTGCTGCTGCTGGTCCTCGTCCTCATCTACTGCCGC 138 6 

1201 AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 12 60 

I I I I I I I I II I I I I I I II I I I I M I II I I I I I I I I I I II I I I I I I I I I I M I I I 
1387 AAGAAGGAAGGACTGGACTCAGACGTGGCTGACTCATCCATCCTTACCTCAGGCTTCCAG 1446 

1261 CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 1320 

M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

14 4 7 CCTGTCAGCATCAAGCCCAGCAAAGCAGACAATCCCCATCTGCTCACCATCCAACCGGAC 1506 

1321 CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 138 0 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
1507 CTCAGCACCACCACGACCACCTACCAGGGCAGCCTGTGTCCCCGGCAGGATGGACCCAGC 1566 

1381 CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 1440 

I M I I I M I I I M I I I II I I I I I I I I I I I I I I I II I I I I I I I II I I I I I II 

1567 CCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGCAGTGGCCGCCATACG 162 6 
1441 CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1627 CTGCACCACAGCTCCCCCACCTCTGAGGCTGAGGACTTCGTCTCCCGCCTCTCCACCCAA 1686 
1501 AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 1560 

I I I I I I I I II II II I I II II II II II I I I M I I I I I I I I I I I I II II I I II I II 

1687 AACTACTTTCGTTCTCTGCCCCGCGGTACCAGCAACATGGCCTATGGGACCTTCAACTTC 1746 

1561 CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 1620 

II I I I II II II II I I I II II II II II II I I I I II I I I I I I I I I I I I II I I M III 
1747 CTCGGGGGCCGGCTGATGATCCCTAACACAGGAATCAGCCTCCTCATACCCCCGGACGCC 1806 

1621 ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCAC7\AGCCGGAAGACGTGAGG 168 0 

II II II II II I I I I I I I I I I I I I M I I II I I I I I I II I I I I I I I I I I II I I I I I I 
1807 ATCCCCCGAGGAAAGATCTACGAGATCTACCTCACTCTGCACAAGCCAGAAGACGTGAGG 1866 

1681 TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 1740 

I I I I M II I I I II I I I I I I M II I I M I I I I I I I I I I I II I I I I I M I I I I I I I II 
1867 TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCTATCGTTAGCTGTGGGCCCCCAGGA 192 6 

1741 GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 18 00 

I I I I I I I I M II I I I M M I I I I I I I II I I I M II I I M I I I I I I I I I M II III 

1927 GTCCTGCTCACCCGGCCAGTCATCCTTGCCATGGACCACTGCGGGGAGCCCAGTCCCGAC 1986 

1801 AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I II I 

1987 AGCTGGAGCCTGCGCCTCAAAAAGCAGTCCTGTGAGGGCAGCTGGGAGGACGTGCTGCAC 204 6 

18 61 CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 192 0 

II II I I I I I I I I I I I II II I M I I M I I I I I I I I I I I I I I I M I I I I I M I I 

2047 CTTGGTGAGGAGTCGCCCTCTCATCTCTACTACTGCCAGCTGGAGGCCGGGGCCTGCTAT 2106 

1921 GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I M I I 
2107 GTCTTCACCGAGCAGCTAGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 2166 

1981 GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 2 040 

I I I I I I I I I I M III I I I I I I I I I I I II I I I I I I I I II M I I I I I I I I I I I I I 
2167 ACCAAGCGCCTCAGGCTCCTTCTGTTTGCCCCTGTGGCCTGTACGTCCCTCGAGTACAAC 2226 

2041 ATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAG 2100 

I I I II II I I I I I I I I I I I M I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I II I 
2227 ATCCGAGTGTACTGCCTGCACGACACCCACGATGCTCTCAAGGAGGTGGTGCAGCTGGAG 22 86 

2101 AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 

I I I I I I I I I I I I I I I I I M I I M I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I 
2287 AAGCAGCTGGGTGGACAGCTGATCCAGGAGCCCCGTGTCCTGCACTTCAAAGACAGTTAC 2346 

2161 CACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTT 2220 

I I I I I I I I II II I I II I I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I II I 
2347 CACAACCTACGTCTGTCCATCCACGACGTGCCCAGCTCCCTGTGGAAGAGCAAGCTCCTT 2406 

2221 GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 228 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I II I I I I III I I I I I 
2407 GTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAATGGCACTCAGCAGTATCTGCAC 2466 

2281 TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 2340 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I II I M I I I I I I I I I I I I I I 



Db 



2467 TGCACCTTCACCCTGGAGCGCGTCAATGCCAGCACCAGCGACCTGGCCTGCAAGGTGTGG 252 6 



Qy 2341 GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 2400 

I I I I I I I I I M I I I I I I II II I M M I M I I I i II I I I I I II I I II I I II I I II 

Db 2527 GTGTGGCAGGTGGAGGGAGATGGACAGAGCTTCAACATCAACTTTAACATCACTAAGGAC 2586 

Qy 2401 ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 24 60 

II I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M 

Db 2587 ACGAGGTTTGCTGAAATGCTGGCTCTGGAGAGTGAAGGGGGGGTCCCAGCCCTGGTGGGC 2 646 

Qy 2461 CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

I I I I I I I I M M II I M I I I I M I I I I I I I I I I I I I I I I I III I I I I I M I I I I I I I 

Db 2647 CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATTACCAGCCTGGACCCA 2706 

Qy 2521 CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 2580 

HIM II I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 2707 CCCTGCAGCCGGGGCGCCGACTGGCGAACTCTAGCCCAGAAACTTCACCTGGACAGCCAT 2766 

Qy 2581 CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 2640 

II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I M I I I I II I 

Db 2767 CTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTCAACCTATGGGAGGCG 2826 

Qy 2 641 CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 

I I I I I II I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 827 CGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCCGCAGCTGTGGCCGGACTGGGCCAG 288 6 

Qy 2701 CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 

I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I II I I 
Db 2887 CCAGATGCTGGCCTCTTCACCGTGTCAGAGGCCGAGTGCTGAGACCAGCCAG 2938 
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BD057524 3014 bp DNA 

Netrin receptors. 

BD057524 

BD057524. 1 GI: 22603130 
JP 2001505062-A/l. 
synthetic construct 
synthetic construct 
artificial sequences. 
1 (bases 1 to 3014) 
Lavigne^M. T. , Leonardo, D. E 
Netrin receptors 

Patent: JP 2001505062-A 1 17-APR-2001; 
THE REGENTS OF THE UNIV OF CALIFORNIA 



linear PAT 27-AUG-2002 



Hinck^L., Masu,M. and Masu^K.K. 



COMMENT PN JP 2001505062~A/1 

PD 17-APR-2001 

PF 19-FEB-1998 JP 1998536840 

PR 19-FEB-1997 US 08/808982 

PI MARC TESSIER LAVIGNE, DAVID E LEONARDO, LINDSAY HINCK,MASAYUKI 

PI MASU, 

PI KAZUKO KEINO MASU 

PC C07Kl/00,C07K14/00,C07K17/00,C07H21/02,C07H21/04,G01N33/53 CC 

Strandedness : Double; 

CC Topology: Linear; 

FH Key Location/Qualifiers. 



FEATURES Location/Qualifiers 
source 1. .3014 

/ organism="synthetic construct" 
/mol_type=" genomic DNA" 
/db_xref="taxon:32630" 

ORIGIN 

Query Match 82.1%; Score 2259; DB 6; Length 3014; 

Best Local Similarity 89.7%; Pred. No. 0; 

Matches 2427; Conservative 0; Mismatches 280; Indels 0; Gaps 0; 

Qy 4 6 ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 105 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 1 ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 60 

Qy 106 CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 165 

II II I I I I I II I II I I I I II I I I I I II I I I I II I I I I I I I I I I I II I I I I I I II 

Db 61 CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 120 

Qy 166 GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 225 

II II M II I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I 11 I I I I I 

Db 121 GACCTGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 180 

Qy 226 GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 285 

III II I I II I I I II I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 181 GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 240 

Qy 2 86 TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 345 

I I I I I I I I I I I I I II I I I II II M I I I I I I I I III I I II I I II I I I I 
Db 241 TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 300 

Qy 346 ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGT^GGTGTTCGGGCTGGAG 405 

I II I I I I I I I I II I II M II II I I I I I I I M I I I I I I I I I I I I I I M I I I I I 
Db 301 ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 360 

Qy 406 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 4 65 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I M I I I II I I I I I I I I I I I II 

Db 361 GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 420 

Qy 466 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

I I I I I I I I II III I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I M I I Ml 

Db 421 TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 480 

Qy 52 6 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 585 

II I I I I I II I I I I I I II II II II II II M II II I I II I I I I I I III 

Db 481 TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 540 

Qy 586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 

I I M I I I I I I I II II I I I I I I M I II I M M I I I I I II I I I M I I I I M I I I 

Db 541 GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 600 

Qy 646 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 7 05 

I I I M I M I I I I I M I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I 

Db 601 ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 660 

Qy 706 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 765 

II I I I I I I I I I I I I I I I I I II I I M I I I I III I I I I II M I I I I I II II 



Db 


661 


TGTGTGGCCAAGT^CATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 


720 


Qy 


766 


GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 
GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 


825 


Db 


721 


780 


Qy 


826 


GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 III 

GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 


885 


Db 


781 


840 


Qy 


886 


TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 

1 1 M 1 1 M 1 M 1 II 1 II 1 1 1 II 1 II 1 1 II 1 1 II 1 1 1 II 1 M 1 II M III II II III 

TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 


945 


Db 


841 


900 


Qy 


946 


TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 

Mill 1 1 1 1 1 1 1 1 1 II 1 M 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 


1005 


Db 


901 


960 


Qy 


1006 


GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 III 1 1 1 1 1 1 1 1 M 1 

GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 


1065 


Db 


961 


1020 


Qy 


1066 


ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 

M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 


1125 


Db 


1021 


1080 


Qy 


1126 


CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 


1185 


Db 


1081 


1140 


Qy 


1186 


CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 

Ml 1 II 1 II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 III 

CTCATTTACTGTCGCAAGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 


1245 


Db 


1141 


1200 


Qy 


1246 


ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 

Mill 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M M 1 1 1 M M 1 1 1 1 1 II II II 1 II M 1 11 M 1 1 

ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 


1305 


Db 


1201 


1260 


Qy 


1306 


ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 

M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 III 1 II 
ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 


1365 


Db 


1261 


1320 


Qy 


1366 


CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 

1 M 1 1 1 1 1 II 1 M 1 II II M 1 1 1 II M II 1 1 1 1 II 1 1 M 1 1 1 1 1 II 1 1 1 II II 1 

CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 


1425 


Db 


1321 


1380 


Qy 


1426 


GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 

1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 


1485 


Db 


1381 


1440 


Qy 


1486 


CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 

1 1 1 1 I 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 t r 1 1 1 > 

i 1 1 1 1 1 1 i 1 1 1 M 1 1 j 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II II 1 1 1 1 MM 
CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 


1545 


Db 


1441 


1500 


Qy 


1546 


GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 
M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II II M 1 M 1 II 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 
GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 


1605 


Db 


1501 


1560 



1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

M Mill MIIIMI IIIIIIM llllllll llllllllllllll lllllllll 

1561 ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 1620 
1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

II I N I I 11 I II I II I I II II II II M M II M I II I M II I I I I II II llllllll 

1621 CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 1680 

1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

I I I I I Mill II I I I I I I I I I I I I II I I I I I I I I I I I I II llllllllllllll 
1681 TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 174 0 

178 6 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 

M I II I I I I I I I I I I I I I M I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I III 
1741 GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 1800 

1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I M I M I I I I M I I I I I II I I I I I I I II I I M I M I I I I I I I I M I I I I I M I I I 

18 01 GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 18 60 

1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

Ml I llllllll llllllll I I I M I I I I I I I I I I I I I M I I I I I I lllllllll 
1861 GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 1920 

1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 

I II M M I I I I I I I I I I I I I I I I 11 I I III I I I I I I I M I I II llllllll II 

1921 CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 1980 

2 026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

Mill Mill I II llllll Illlllllll II lllllllll 

1981 TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 2040 

2 086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

I M II I I I I I I I I I I I I I I I I I I M I M I I I I I I I M I I I I I M I II I I M I I I I I 

2041 GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 2100 

2146 TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2205 

Mill I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2101 TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 2160 

2206 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2265 

Mill I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II I I I 

2161 AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 2220 

2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

MM III I I I I I I I I I I I I I M I I I I I I I I I I III I I I I II I I II llllll 
2221 CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 2280 

2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

M M II M I I I I I M I II I I I II I I I I I I I I II I I I I I II I I I I I I Illlllllll 

2281 GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 2340 

238 6 AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 2445 

MIIIMI II M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
2341 AACATCACTAAGGACACTSiAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 2400 



Qy 


2446 


Db 


2401 


Qy 


2506 


Db 


2461 


Qy 


2566 


Db 


2521 


Qy 


2626 


Db 


2581 


Qy 


2686 


Db 


2641 


Qy 


2746 


Db 


2701 



CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I II 

CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATC 
TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 

MM MIIIIIMMMI M Mill MIIMI M IMIIIMIII 

GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 

CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 

M M II II II M I M I I II I I I I I I I I I I I II II I I I I I I I I I I I II I I II I I I I I II 

CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 
AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 

Mill IIIMIM llllllllllllllllllllllll IMIIIIIII Mill III 

AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 



GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 

M II II II I II II II I II I I I M II II M I I I II M I I I M M I II I I 

GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGAGAC 



I Mill 



2505 
2460 
2565 
2520 
2625 
2580 
2685 
2640 
2745 
2700 
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AX268596 
Sequence 15 
AX268596 
AX268596.1 



2697 bp DNA 
from Patent WO0175440. 

GI:16541710 



linear PAT 29-OCT-2001 



Rattus sp. 

Rattus sp. . X. • 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodent ia; Sciurognathi; Muridae; Murinae; 
Rattus . 
1 

Cochran, S.W., Paterson, G. Y. , Ohashi^Y.W., Morris, B.Y. and 
Pratt, J. Y. 

Schizophrenia related genes 

Patent: WO 0175440-A 15 ll-OCT-2001; 

WELFIDE CORPORATION (JP) 

Location/Qualifiers 

1. .2697 

/organism="Rattus sp." 
/mol_type="unassigned DNA" 
/db xref="taxon: 10118" 



ORIGIN 



Query Match 81.8%; Score 2252.2; DB 6; Length 2697; 

Best Local Similarity 89.7%; Pred. No. 0; . ^ ^ n. 

Matches 2419; Conservative 0; Mismatches 278; Indels 0; Gaps U, 



Qy 

Db 



105 



4 6 ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 

M II II II I I II I II I II I M I I M I I I II M M M II M M II II I II II I Mill 

1 ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 60 



106 CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 165 

I II iiiiiiiiiiiiiiiii 'ii'L'i.liliii.iiii!""" 

61 



CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 12 0 



166 GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 225 

llllllll IIIIIIIIIIIIM Mill Mill IIIIIMI Mil " 

121 GACC 



TGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 180 



226 GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 285 

III II I IMIIIIIIll Mill Mill IIIIIIIIIIIIIIIMIII Mill 
181 GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 240 

286 TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 345 

Mill II I I II I Ill II II I MM III III I MM! 1 

241 TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 300 



346 ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 

IMIIIMMMM II II II M IMIIIIIIll Mill Mill IMIIIIII 

301 ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 
406 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 

I I II II II I M II II II II I I II I II II I II I I II I I I M I I I I I M MM 

361 GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 
4 66 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

IMIIIII II III IIMIMIMM IMII llllllll III 

421 TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 



405 
360 
465 
420 



480 
585 
540 
645 

541 GTGGAGTGGCTTCGAAATGAGGAC^ 600 
646 



526 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 

II llllllll II I I M II II M Mill Mill Ml 

481 TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 
586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 

IIIIIIIMII 11 II I II llllllll Mill II IIIMIII MUM 



601 
706 
661 
766 



ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 

IIIIIIIIIIIIIIIII II I I I I I I I I I I I II I I I II I M II II I II II I I M II 
ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 

TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 

II IIIIIIIIIIIIIIIII II II Ml MM II II Mill II II 

TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 



GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 

IMIIMMMMIMIMIMIMI MMIMMIMMIIM Ill 

721 GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 



826 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 

I M II I II M I M II I! I M M II II I 11 I I I I I M M M II I I I II II M M M III 

GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 



781 



705 
660 
765 
720 
825 
780 
885 
840 



886 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

I M I I I I II I M II I I I 1 I I II I I II I II I I I I M I II M _M_I I Jii 

841 TGI 



STGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 900 



946 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 

I I I I I I I I I I I I I I I I I II II MM II 

901 TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 



1005 

960 
1065 



1006 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 

IIMIIilllillilllillliliil Ml Mil I I Ml I Mi I I 

9 61 GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 1020 
1066 ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

M M II M M M M M I M I I II II I I I I I I MUM I I I II 1 II I I 

1021 ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 1080 



1126 



1185 



CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 

Mill I Mill II I I I II I I I I I I I I I M Mill III 

1081 CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 1140 



1186 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

III III M II I II II I M I II I II I I I I I II I II I I II II II I Ml 

1141 CTCATTTACTGTCGCAAGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 



1200 



1246 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1305 

I II I I II I II II 11 I I M I II I M I II II I I M II I M I II II M I I M II 

1201 ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 



1306 



1260 
1365 



ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 

I I I I I II II II I II II II II M I II I II II II II I I M I I I II Ml I II 

1261 ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 1320 

1366 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

I 111 MM IIIIIIIIMIIMIIMIII I Mill IIIIIMIIMIII Mill 
1321 CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 1380 



1426 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 

I I II II I II II II 1 II II 1 1 I M 1 I I II II I I II M M M 

1381 AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 
1486 CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 

IIIMIMMMM IMIIIM M IMMIMIM MMMMIMMM MM 

1441 CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 
1546 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 

II MMMMMMMMMMMMIMIMMMI MMIIIIM 

1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 



1485 
1440 
1545 
1500 
1605 
1560 



1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

II I I II I MM Mill II II M 11 I 1 1 1 M 11 M I M 1 M 1 II I II I 

ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 



1561 



1620 
1725 



1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 
II lllllllllllllllllllMllllllllllllllllllllllllM IMIIIM 
1621 CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 1680 

1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

I II II II II I II I M 1 I I II 1 M M 11 1 1 M MM I M II II 1 II II 1 I II I II 

1681 TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 1740 



1786 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 



Db 


1741 


1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 IIMIII MM MMIMI 1 

GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 


1 Q n ri 


Qy 


1846 


GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 

1 M 1 M M M 1 M M M M M M M 1 M M M 1 M M M M M M M M M M M 

GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 


1 n r\ c 


Db 


1801 


1 o 


Qy 


1906 


GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 

Ml 1 1 1 M M M 1 M M M 1 M M M M M M M M M M M M M M M M M 1 

GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 


19 DO 


Db 


1861 


1 A o n 
19zU 


Qy 


1966 


CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 

1 1 1 1 M 1 M M M M M M M Ml M M M M 1 M M MMIMI II 

CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 


2025 


Db 


1921 


1980 


Qy 


2026 


TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 

Mill 1 II II 1 M M II II M MMIMI II 1 II II II 1 M 1 II II II II 1 M 

TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 


2085 


Db 


1981 


2040 


Qy 


2086 


GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 

MM II 1 1 M II M M II II 1 II 1 1 II II II 1 II 1 II 1 II 1 M II 1 

GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 


2145 


Db 


2041 


2100 


Qy 


2146 


TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 

Mill M II II II II 1 II 1 II 1 II M 1 II II 11 II II 11 M II 1 II 11 II II 1 II 

TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 


2205 


Db 


2101 


2160 


Qy 


2206 


AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 

Mill 1 1 11 1 1 1 1 II II II 11 II II 1 1 II II II II 1 II II 1 M 11 11 II 1 II 1 1 

AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 


2265 


Db 


2161 


2220 


Qy 


2266 


CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 

IIMIII II II 1 II II 1 1 M II II II 1 1 II II 1 1 1 1 1 II II II II 1 M 1 1 1 
CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 


2325 


Db 


2221 


2280 


Qy 


2326 


GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 

MM II 1 1 1 II II II II M 1 II 1 II II 1 II II 1 II II 1 M II 1 1 M II 

GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 


2385 


Db 


2281 


234 0 


Qy 


2386 


AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 

MMIMI II II II II II II II II M 1 1 II 11 II 1 1 M II II II II 11 II II M 1 

AACATCACTAAGGACACAAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 


2445 


Db 


2341 


2400 


Qy 


2446 


CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 

M M 1 11 II 1 II II II II 11 II 11 II M 1 II 11 1 1 II II 11 II 1 M M II II II 1 M 

CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATC 


2505 


Db 


2401 


24 60 


Qy 


2506 


TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 

1 1 II II II II 1 II II 1 M 11 MM! II M 1 II 11 1 Mill M 1 II 11 II II 

GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 


2565 


Db 


2461 


2520 


yy 


2566 


CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 

II II II M II M II II 1 II 1 11 II II 1 1 M 11 1 1 II 1 II 1 II 1 1 II II 1 II II M II 1 

CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 


2625 


Db 


2521 


2580 


Qy 


2626 


1 AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 

Mill II M 1 1 II II M II M 1 M 1 1 II II II II II 1 1 II II 1 II M Mill Ml 


2685 



Db 



2581 AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 2640 



Qy 2 68 6 GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml 
Db 2641 GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGA 2697 
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CDS 



RNU87305 2697 bp itiRNA linear ROD 15-MAY-1997 

Rattus norvegicus transmembrane receptor UncSHl mRNA, complete cds . 
U87305 

U87305. 1 GI:2055391 

Rattus norvegicus (Norway rat) 

Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 2697) 

Leonardo, E. D. , Hinck^L., Masu,M. , Keino-Masu, K. , Ackerman, S , L . and 
Tessier-Lavigne,M. 

Vertebrate homologues of C. elegans UNC-5 are candidate netrin 

receptors 

Nature 386 (6627), 833-838 (1997) 

97271897 

9126742 

2 (bases 1 to 2697) 

Leonardo, E. D. , Hinck^L., Masu,M. , Keino-Masu, K. and 

Tessier-Lavigne,M. 

Direct Submission 

Submitted (28- JAN~1997 ) Anatomy, UCSF, 513 Parnassus, San 
Francisco, CA 94143-0452, USA 

Location/Qualifiers 

1. .2697 

/organism="Rattus norvegicus" 
/mol_type="mRNA" 
/db_xref="taxon: 10116" 

/tissue_type="brain and ventral spinal cord" 
/dev_stage="18 day embyo and 13 day embryo" 
1. .2697 
/ codon_start=l 

/product=" transmembrane receptor UNC5H1" 
/protein_id-"AAB57678 ,1" 
/db_xref="GI:2055392" 

/translation="MAVRPGLWPVLLGIVLAAWLRGSGAQQSATVANPVPGANPDLLP 
HFLVEPEDVYIVKNKPVLLVCKAVPATQIFFKCNGEWVRQVDHVIERSTDSSSGLPTM 
EVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEV 
SLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHSLWRQARLADTAN 
YTCVAKNIVARRRSTSAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLN 
GGAFCEGQNVQKTACATLCPVDGSWSSWSKWSACGLDCTHWRSRECSDPAPRNGGEEC 
RGADLDTRNCTSDLCLHTASCPEDVALYIGLVAVAVCLFLLLLALGLIYCRKKEGLDS 
DVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCSRQDGPSPKFQL 
SNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNFLGG 
RLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPWSCGPPGV 
LLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGAC 
YVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEW 



QLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGT 
QQYLHCTFTLERINASTSDIACKVWVWQVEGDGQSFNINFNITKDTRFAELLALESEG 
GVPALVGPSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKLHLDSHLSFFASKPSPT 
AMI LNLWEARHFPNGNLGQLAAAVAGLGQPDAGLFTVS EAEC " 

ORIGIN 

Query Match 81.8%; Score 2252.2; DB 10; Length 2697; 

Best Local Similarity 89.7%; Pred. No. 0; 

Matches 2419; Conservative 0; Mismatches 278; Indels 0; Gaps 0; 



Qy 

Db 


46 
1 


ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 


105 
60 


Qy 


106 


CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 

II II 1 II 1 1 1 II 1 1 1 M 1 1 1 1 1 II 1 M 1 1 1 1 1 1 M 1 1 II 1 1 1 1 II 1 II II 1 II 1 

CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 


165 


Db 


61 


120 


Qy 


166 


GACCTGGTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 

II M II II 1 M II 1 II M 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II M 1 II 1 II II 1 1 1 Ml 

GACCTGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 


225 


Db 


121 


180 


Qy 


226 


GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 

III II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 
GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 


285 


Db 


181 


240 


Qy 


286 


TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 

1 1 1 1 1 1 1 1 M 1 1 1 II II 1 1 1 II II 1 M 1 1 1 1 1 Ml 1 1 II II II MM 
TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 


345 


Db 


241 


300 


Qy 


346 


ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 

II 1 1 1 II 1 1 II 1 II II II M II 1 II 1 II 1 II II II II 1 Mill 1 1 1 II 1 II 1 

ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 


405 


Db 


301 


360 


Qy 


406 


GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 

II 1 II II II M II 1 II 1 1 II 1 1 II II II II II M II 1 1 1 II II II 11 1 M II II 1 M 
GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 


465 


Db 


361 


420 


Qy 


466 


TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 

II 1 1 1 II 1 M III 1 II II 1 II M II II II II M 1 1 II 1 II II 1 II 1 1 II III 

TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 


525 


Db 


421 


480 


Qy 


526 


TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 

M II II M M II II 1 II II M M II II M M M Mill Mill III 

TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 


585 


Db 


481 


540 


Qy 


586 


GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 

II II II 1 II M II II 1 II II 1 1 1 II II II 1 1 1 1 1 1 1 II II II 1 II 1 1 II II 1 

GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 


645 


Db 


541 


600 


Qy 


646 


ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 

II II II II M II II 1 M M II II 1 M II II II II 1 M II II II M II II II 1 II 1 

ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 


705 


Db 


601 


660 


Qy 


706 


TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 

II 1 1 1 1 II 1 II II II II M II 1 1 II 1 II 1 III 1 1 1 1 II II Mill M II 

TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 


765 


Db 


661 


720 



Qy 


766 


Db 


721 


Qy 


826 


Db 


781 


Qy 


886 


Db 


841 


Qv 


946 


Db 


901 


Qv 


1006 


Db 


961 


Qv 


1066 


Db 


1021 


Qv 


1126 


Db 


1081 


Qv 


1186 


Db 


1141 


Ov 


1246 


Db 


1201 


Qv 


1306 


Db 


1261 


Qv 


1366 


Db 


1321 


Qv 


1426 


Db 


1381 


Qy 


1486 


Db 


1441 


Qy 


1546 


Db 


1501 



GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 825 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I 
GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 780 

GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 8 85 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I III 
GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 840 

TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

II II I I I I I II I M I II I I I I I I I I II M I I I I I I I I I I M I I I I I I I I II II III 

TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 900 

TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 1005 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I 
TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 960 

GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1065 

II II II I I II II I I I M I I I I I I I I I I II I I I I I I I I I I I III I I I I I I I I I I I 
GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 1020 

ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

I II I I I I I I I I I II I I I I I I I I I I I I I INI I I I I I I I I I I I I I I I I I I I I I 
ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 1080 

CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1185 

II I II I I I I I I I I I I M I II I I I I I I I I I I I II I I I I I I I I I 
CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 1140 

CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

III I I I I II II I I I I I I I I II I I I I I I I I I I M I I I I I I I II I I I I I I I III 
CTCATTTACTGTCGCAAGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 1200 

ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1305 

I II II M I I I M I I I I I I I I I I I I I I I I I M I II I I I I I II I I I I I I M I I II I I I I 
ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 1260 

ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I III I II 
ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 1320 

CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

II I I I I M M I I I I I I M I I I I I I M I II I Mill II I I I I II II II II II I II 

CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 1380 

GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1485 

I I M I M II II II II II II I II I I II I II I M II I I I I Mill II II I I II I 
AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 1440 

CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 1545 

I I M M I I II M II I II I I II I II M M I M M II I M M I II M M II I I I I I 

CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 1500 
GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

I I I I I I I II M I M I II I I I II I M II I II M M I I I I I I I I II II II M II II I I M 
GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 1560 



Qy 


1606 


Db 


1561 


Qy 


1666 


Db 


1621 


Qy 


1726 


Db 


1681 


Qy 


1786 


Db 


1741 


Qy 


1846 


Db 


1801 


Qy 


1906 


Db 


1861 


Qy 


1966 


Db 


1921 


Qy 


2026 


Db 


1981 


Qy 


2086 


Db 


2041 


Qy 


2146 


Db 


2101 


Qy 


2206 


Db 


2161 


Qy 


2266 


Db 


2221 


Qy 


2326 


Db 


2281 


Qy 


2386 


Db 


2341 


Qy 


2446 



ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 

M I I I I I M I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I 

ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 
CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 

II I II M I I I I I I I I I I I I I II M I I I I I II II I I II M I I M I I I M I I M I I I I I 

CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 

TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 

II I II II III II I I I I I I M I I II I I I I M I I I I I I II II I I I I I I II I I I I II 
TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 



1665 



1620 



1725 



1680 



1785 



1740 



1845 



GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 

I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I M I I II I I II I I I I I I I I I I I III 
GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 1800 

GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I I II I I I II II M I II I II I I I I I I I II I I I I I I I I I I I M I I I I II M I I I I I I 
GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 1860 

GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

III I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 1920 

CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2 025 

I I I I I I II I I M I I I I I I I I I I I I I I I III I I I I I I I I I I I II 11 I II I II II 
CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 1980 

TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

I I I I I I I I I I I I I I I I I I I II I I I I I II I II M I I I I I I I II II I I I I I I I I I 
TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 2040 

GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 214 5 

I I I I I I II I I I I I I I I I II I M I I I I I I I I I I M I M I II I I I I II I I I 

GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 2100 

TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2205 

I I I I I 1 I I I I I I I I I I I I I I I I II II I I I I I I I I I II I M I I M I I I I I I I II I I 
TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 2160 

AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2265 

I I I I I Mill I I I I II I M I I I II I I I I I I I I I Mill M II I II I I M I I II I 
AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 222 0 



CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 

I II I III I I I I II II I M I M II I II II I M I III I I II I II I M I I I II I 

CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 



2325 



2280 



GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

M I I I I I M M I I M I M M I I I I I I I I I I I II II I II I I I I I I I I M I I I M I I I 

GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 2340 
AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 2445 

II II M M I I M I M I I II II II I II II II I I M I II I I I II I II M I I I M II I 

AACATCACTAAGGACACAAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 24 00 
CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2505 



Db 



2401 



2460 



Qy 2506 TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAG7WVCTC 2565 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I M I M 

Db 24 61 GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 252 0 

Qy 2566 CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2625 

I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I 1 I M I I I I I I I I I I I I I I I I I I I I I 
Db 2521 CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 2580 

Qy 2626 AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 2685 

I I I I I I I I I I I I I I I I I I M I I I I M I I M I I I I I I I I I I I I I I M I I I I I I III 

Db 2581 AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 2640 

Qy 2686 GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

II II I I II I II I I I I I I I I I I M I I I I I I I M I II I I I I I II I I I I II III 

Db 2641 GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGA 2697 



RESULT 9 
BC058084 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 

PUBMED 
REFERENCE 



BC058084 3844 bp mRNA linear ROD 21-OCT-2003 

Mus musculus unc-5 homolog A (C. elegans), mRNA (cDNA clone 
MGC: 66671 IMAGE : 68134 63 ) , complete cds . 
BC058084 

BC058084.1 GI: 34784158 
MGC. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (bases 1 to 3844) 

Strausberg,R.L. , Feingold, E. A. , Grouse, L.H. , Derge, J.G. , 
Klausner, R.D. , Collins, F. S . , Wagner, L., Shenmen, CM. , Schuler, G. D. , 
Altschul, S.F» , Zeeberg,B., Buetow,K.H., Schaef er, C . F . , Bhat,N.K., 
Hopkins,R.F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko,L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M. B. , Bonaldo^M. F. , Casavant, T. L. , 
Scheetz,T.E. , Brownstein, M. J . , Usdin,T.B. , Toshiyuki, S. , 
Carninci,P., Prange,C., Raha,S.S., Loquellano, N . A. , Peters, G. J., 
Abramson,R.D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan,K. J. , Malek,J.A., Gunaratne, P.H. , Richards, S., 
Worley,K.C., Hale,S., Garcia, A. M. , Gay,L.J., Hulyk,S.W., 
Villalon,D.K. , Muzny,D.M., Sodergren, E . J . , Lu,X., Gibbs,R.A. , 
Fahey,J., Helton, E., Ketteman,M., Madan,A. , Rodrigues, S . , 
Sanchez, A., Whiting, M., Madan,A. , Young,A.C., Shevchenko, Y. , 
Bouf fard, G.G. , Blakesley, R. W . , Touchman, J.W. , Green, E.D. , 
Dickson,M.C. , Rodriguez, A. C . , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterfield,Y.S. , Krzywinski,M. I . , Skalska,U. , Smailus,D.E. , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
22388257 

12477932 

2 (bases 1 to 3844) 
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Strausberg, R. 
Direct Submission 

Submitted (08-SEP-2003) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MB 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 

cDNA Library Preparation: M. Bento Scares, University of Iowa 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

EC Cancer Agency, Vancouver, BC, Canada 

inf o@bcgsc.bc. ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butter field, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein^ Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 126 Row: b Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 23346570. 

Location/Qualifiers 

1. .3844 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6" 
/db_xref="taxon: 10090" 
/clone="MGC: 66671 IMAGE : 6813463" 

/tissue_type="Brain, mouse, 13.5, 14.5, 16.5, 17.5 dpc" 

/ cl one_l ib= " N I H_BMAP_FW 0 " 

/lab_host="DH10B" 

/note="Vector: pYX-ASC" 

1. .3844 

/gene="Unc5a" 

/db_xref-"LocusID: 107448" 

/db_xref="MGI : 894682" 

252. .2780 

/codon__start=l 

/product="Unc5a protein" 

/protein_id="AAH58084 . 1" 

/db_xref="GI: 34784159" 

/db_xref="LocusID: 107448" 

/translation="MAVRPGLWPALLGIVLTAWLRGSGAQQSATVANPVPGANPDLLP 
HFLVEPEDVYIVKNKPVLLVCKAVPATQIFFKCNGEWVRQVDHVIERSTDGSSGLPTM 
EVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEV 
SLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHSLWRQARLADTPN 
YTCVAKNIVARRRSASAAVIVYVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGE 
ECRGADLDTRNCTSDLCLHTSSGPEDVALYIGLVAVAVCLILLLLVLVLIYCRKKEGL 
DSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKF 



misc feature 



misc feature 



misc feature 



misc feature 



QLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNFL 

GGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 

GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAG 

ACYVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKE 

WQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWN 

GTQQYLHCTFTLERVNASTSDLACKVWVWQVEGDGQSFNINFNITKDTRFAEMLALES 

EGGVPALVGPSAFKIPFLIRQKIITSLDPPCSRGADWRTLAQKLHLDSHLSFFASKPS 

PTAMI LNLWEARHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC " 

714. .926 

/note="IG; Region : Immunoglobulin" 
/db_xref="CDD: smart00409" 
984. .1130 

/note="TSPl; Region: Thrombospondin type 1 repeats" 

/db_xref="CDD:smart00209" 

1566. .1877 

/note="ZU5; Region: Domain present in ZO-1 and Unc5-like 
netrin receptors" 
/db_xref="CDD:smart00218" 
2496. .2684 

/note="DEATH; Region: DEATH domain^ found in proteins 
involved in cell death (apoptosis) . Alpha-helical domain 
present in a variety of proteins with apoptotic functions. 
Some (but not all) of these domains form homotypic and 
heterotypic dimers" 
/db xref="CDD:smart00005" 



ORIGIN 



Query Match 72.1%; Score 1983.6; DB 10; Length 3844; 

Best Local Similarity 84.3%; Pred. No. 0; 

Matches 2320; Conservative 0; Mismatches 264; Indels 168; Gaps 



1; 



Qy 


1 


Db 


207 


Qy 


61 


Db 


2 67 


Qy 


121 


Db 


327 


Qy 


181 


Db 


387 


Qy 


241 


Db 


447 


Qy 


301 


Db 


507 



1 1 1 1 



II I 1 1 1 1 1 1 1 



I I I I I I II I M I I II I I I I II I M II II 



I II I I I II M I II I I I I M I II I I I I I I 



I I 



I II I I II I II M I I I I I I I 



CAGCAGAGTGCCACCGTGGCCTyVCCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 

I I II I I I I I I I I M I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAGCAGAGTGCCACAGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTGCCCCAC 

TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 

I I II II I I I I I II I I II I I I I I I I I I I I I I I I I I II I M I I M I I 11 I I I I I M 

TTCCTGGTAGAGCCGGAGGACGTGTACATTGTCAAGAACAAGCCCGTGCTGCTGGTGTGC 

AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
AAGGCTGTGCCCGCCACCCAGATCTTCTTCAAGTGCAACGGGGAATGGGTTCGCCAGGTC 



180 



386 



240 



446 



300 



506 



360 



GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 

II I I I I I II II I I M I I I I Mill I I I I I II I I II I I I I I I II I I II I I 
GATCACGTCATTGAACGCAGCACTGACGGCAGCAGCGGATTGCCAACCATGGAGGTCCGG 566 



Qy 



361 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 
II II II I I I II I I I II I II I I I I I I I II I I I I I M I I I I I I I I M I I I II I I I I 



Db 


567 


ATCAACGTATCAAGGCAGCAGGTCGAGAAAGTGTTTGGGCTGGAGGAGTACTGGTGCCAG 


626 


Qy 


421 


TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II 1 M II 1 1 M 1 1 1 1 II 1 1 1 1 1 II III 
TGTGTGGCATGGAGCTCCTCAGGAACCACCAAAAGCCAGAAGGCCTACATCCGGATTGCC 


480 


Db 


627 


686 


Qy 


481 


AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 

II 1 1 1 1 1 1 1 II 1 1 1 M M 1 1 1 1 M 1 1 II 1 1 M 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 III 

TATTTGCGCAAGAACTTTGAGCAGGAGCCGCTGGCCAAGGAAGTGTCACTGGAGCAAGGC 


540 


Db 


687 


746 


Qy 

Db 


541 
747 


ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 

II 1 1 1 M II II M II Mill II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 

ATTGTGCTACCTTGTCGCCCCCCGGAAGGAATCCCCCCAGCTGAGGTGGAGTGGCTCCGA 


600 
806 


Qy 


601 


AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 

II 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AATGAGGACCTCGTGGACCCCTCCCTCGACCCCAATGTGTACATCACACGGGAGCACAGC 


660 


Db 


807 


866 


Qy 

Db 


661 
867 


CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 

II II 1 1 II 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 
CTAGTCGTGCGGCAGGCCCGCCTGGCCGACACTCCCAACTACACCTGCGTGGCCAAGAAC 


720 
926 


Qy 


721 


ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 

1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 II II 1 1 1 II II II III 

ATCGTGGCCCGTCGCCGAAGCGCCTCTGCGGCCGTCATTGTTTATGTG 


780 


Db 


927 


974 


Qy 


781 


TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 


840 


Db 


975 




974 




Qy 


841 


AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 


900 


Db 


975 




974 




Qy 


901 


GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 

II II 1 1 1 1 1 1 1 1 1 1 1 Mill 


960 


Db 


975 


998 


Qy 


961 


AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 

1 M 1 1 M 1 II 1 II II II 1 1 II II II II 1 1 M 1 1 1 1 1 1 II M II II 1 1 II Mini 

AAGTGGTCAGCCTGCGGGCTTGACTGCACCCACTGGCGGAGCCGGGAGTGCTCCGACCCA 


1020 


Db 


999 


1058 


Qy 


1021 


GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 11 1 1 
GCGCCCCGCAACGGAGGTGAGGAGTGCCGGGGTGCTGACCTGGACACCCGCAACTGTACC 


1080 


Db 


1059 


1118 


Qy 


1081 


AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 

1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGTGACCTCTGCCTGCACACCTCTTCCGGCCCCGAGGACGTGGCTCTCTACATCGGCCTC 


1140 


Db 


1119 


1178 


Qy 


1141 


ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 

M 1 1 1 1 t 1 1 1 t 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 II 1 II 1 i 
1 1 1 11 1 1 1 1 11 1 1 II 1 1 1 II 1 1 1 1 i 1 1 1 1 1 1 1 1 I 1 1 Mill 1 II 1 1 1 1 1 

GTCGCCGTGGCCGTGTGCCTCATCTTGCTGCTGCTGGTCCTCGTCCTCATCTACTGCCGC 


1200 


Db 


1179 


1238 


Qy 

Db 


1201 
1239 


AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAGAAGGAAGGACTGGACTCAGACGTGGCTGACTCATCCATCCTTACCTCAGGCTTCCAG 


1260 
1298 



Qy 


1261 


Db 


1299 


Qy 


1321 


Db 


1359 


Qy 


1381 


Db 


1419 


Qy 


1441 


Db 


1479 


Qy 


1501 


Db 


1539 


Qv 


1561 


Db 


1599 


Qy 


1621 


Db 


1659 


Qv 


1681 


Db 


1719 


Qv 


1741 


Db 


1779 


Qy 


1801 


Db 


1839 


Qy 


1861 


Db 


1899 


Qy 


1921 


Db 


1959 


Qy 


1981 


Db 


2019 


Qy 


2041 


Db 


2079 



CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 1320 

II I II I I I I I II I I II I I II I II II II I I I I II II I II I I I I II I I I I II I I I I I I I 
CCTGTCAGCATCAAGCCCAGCAAAGCAGACAATCCCCATCTGCTCACCATCCAACCGGAC 1358 

CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 1380 

I II II I II II II M I I I II II II I II M I II M I I I I I I I I I I I II I II I I I I I I I 
CTCAGCACCACCACGACCACCTACCAGGGCAGCCTGTGTCCCCGGCAGGATGGACCCAGC 1418 

CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 1440 

I I M II II I I I I II I I I I II I I II I I I I II I II II I I I II I I I I I I II I II 
CCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGCAGTGGCCGCCATACG 1478 

CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 

I I I II I I I I I II II I II II I M I I I I II Mill M I I I II M I I I II I I I I I I I I I 
CTGCACCACAGCTCCCCCACCTCTGAGGCTGAGGACTTCGTCTCCCGCCTCTCCACCCAA 1538 

AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 1560 

I II I I II I II II I I I I I I II II I II I I I II I I II I I I I I I I M I I I I I M M I I 

AACTACTTTCGTTCTCTGCCCCGCGGTACCAGCAACATGGCCTATGGGACCTTCAACTTC 1598 

CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 162 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I II I I I I I I I I I I I I I I I II III 
CTCGGGGGCCGGCTGATGATCCCTAACACAGGAATCAGCCTCCTCATACCCCCGGACGCC 1658 

ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGG 1680 

II I I II I I I I I I I I I I I I M I I I I I I I II I I I M I I I II I I II M I I I I I I I I I I 
ATCCCCCGAGGAAAGATCTACGAGATCTACCTCACTCTGCACAAGCCAGAAGACGTGAGG 1718 

TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 1740 

I I I I I I I II I I I I II I II I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I II 
TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCTATCGTTAGCTGTGGGCCCCCAGGA 1778 

GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 1800 

I I I I I M I I I I I I I I I I 11 I I I I I I I II I I I I I I I I I I I I I I II I I M I I II Ml 
GTCCTGCTCACCCGGCCAGTCATCCTTGCCATGGACCACTGCGGGGAGCCCAGTCCCGAC 1838 

AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 

I II II II I II M I II II II I II II I II II II I II I II II II II I I M I M I II II II 
AGCTGGAGCCTGCGCCTCAAAAAGCAGTCCTGTGAGGGCAGCTGGGAGGACGTGCTGCAC 1898 



I II II I II I II I II II II II II II II I I I I I I II II II II II I II M I I I I 



GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 1980 

I II II M I II II II II II II II I II II II I II I I I II II II II I II II I II II I I II I 
GTCTTTACCGAGCAGCTAGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 2018 

GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 2040 

II II I II M M I Ml II I II I II II I II I I M M II II II M II I M I I I II I 

ACCAAGCGCCTCAGGCTCCTTCTGTTTGCCCCTGTGGCCTGTACGTCCCTCGAGTACAAC 2 078 
ATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAG 2100 

II M I II I I M I M II I I I II M M II I I M I II II I M I II II I I I II II I II I I 

ATCCGAGTGTACTGCCTGCACGACACCCACGATGCTCTCAAGGAGGTGGTGCAGCTGGAG 2138 



Qy 


2101 


Db 


2139 


Qy 


2161 


Db 


2199 


Qy 


2221 


Db 


2259 


Qy 


2281 


Db 


2319 


Qy 


2341 


Db 


2379 


Qy 


2401 


Db 


2439 


Qy 


2461 


Db 


2499 


Qy 


2521 


Db 


2559 


Qy 


2581 


Db 


2619 


Qy 


2641 


Db 


2679 


Qy 


2701 


Db 


2739 



AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II II I II I I I I II I II I I I I II I M 

AAGCAGCTGGGTGGACAGCTGATCCAGGAGCCCCGTGTCCTGCACTTCAAAGACAGTTAC 2198 
CACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTT 2220 

I I I I II II II II I I I I I I I I I I I I I M I I I I I I I I I I I I M I M I I I I I I I M I I 

CACAACCTACGTCTGTCCATCCACGACGTGCCCAGCTCCCTGTGGAAGAGCAAGCTCCTT 2258 

GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 2280 

II II II I II I I M I I II I II I I II I M I I I I I I I I II II I I I Mil III I I II I 
GTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAATGGCACTCAGCAGTATCTGCAC 2318 

TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 2340 

M I I I M I 11 I I I I I I I I I I MM lllllll II IMIIIIIIIMIM Mill 

TGCACCTTCACCCTGGAGCGCGTCAATGCCAGCACCAGCGACCTGGCCTGCAAGGTGTGG 237 8 

GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 24 00 

M I II II II II I I II II II II II II II I M I MM lllllll II I I I I 

GTGTGGCAGGTGGAGGGAGATGGACAGAGCTTCAACATCAACTTTAACATCACTAAGGAC 2438 

ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 2460 

II I M II I I I II I I I II I I I II I II II I II I II I II II II M II I II I II I I I II I 
ACGAGGTTTGCTGAAATGCTGGCTCTGGAGAGTGAAGGGGGGGTCCCAGCCCTGGTGGGC 2498 

CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

II I I I II M II II I I I I II I I II II I II II I I I II I I II I Ml II I I I II II I I II I 
CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATTACCAGCCTGGACCCA 2558 

CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAG7\AACTCCACCTGGACAGCCAT 2580 

Mill II II I I I II M I II I II I I II I I II II I II I II I I II I II I II II I II I 
CCCTGCAGCCGGGGCGCCGACTGGCGAACTCTAGCCCAGAAACTTCACCTGGACAGCCAT 2618 

CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 2640 

II I M I I I I II II I I I M I I II II I M I I I I I I I I II I I II M II M I I II I I I I II 

CTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTCAACCTATGGGAGGCG 267 8 

CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 

I II I I I I II I M M I I II I I II I I I M M II II I I I I II II II I I II M I II I M I 

CGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCCGCAGCTGTGGCCGGACTGGGCCAG 2738 
CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 27 52 

Mill M M II II I II I II MM! I I M I II I II I M II M I I I II 

CCAGATGCTGGCCTCTTCACCGTGTCAGAGGCCGAGTGCTGAGACCAGCCAG 2790 
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BC009333 

LOCUS 

DEFINITION 
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VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 



BC009333 
Homo sapiens 
BC009333 

BC009333.2 GI: 4 0226527 



2688 bp mRNA linear PRI 19-DEC-2003 
cDNA clone IMAGE: 41267 60, partial cds . 



Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 2688) 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Strausberg^R.L. , Feingold, E . A. , Grouse, L.H., Derge^J.G., 
Klausner,R.D. , Collins , F. S . , Wagner, L., Shenmen, C .M. , Schuler , G. D . , 
Altschul,S.F. , Zeeberg,B., Buetow,K.H., Schaef er, C . F. , Bhat^N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh^F-, 
Diatchenko,L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B. , Bonaldo,M. F. , Casavant, T . L . , 
Scheetz,T.E. , Brownstein,M. J . , Usdin,T.B., Toshiyuki, S . , 
Carninci,P., Prange,C., Raha,S.S., Loquellano, N . A. , Peters, G. J., 
Abramson,R.D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan,K. J. , Malek,J.A., Gunaratne, P . H . , Richards, S., 
Worley,K.C., Hale,S., Garcia,A.M. , Gay,L.J., Hulyk,S.W., 
Villalon,D,K. , Muzny,D.M., Sodergren, E. J. , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues , S . , 
Sanchez, A., Whiting, M., Madan,A., Young,A.C., Shevchenko, Y. , 
Bouffard,G.G. , Blakesley, R. W. , Touchman, J. W. , Green, E.D., 
Dickson,iyi.C. , Rodriguez, A. C . , Grimwood, J . , Schmutz,J., Myers, R.M., 
Butterfield, Y.S. , Krzywinski,M. I . , Skalska,U. , Smailus , D . E . , 
Schnerch,A., Schein,J.E-, Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 

2 (bases 1 to 2688) 

Strausberg,R. 

Direct Submission 

Submitted (12-JUN-2001) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

On Dec 19, 2003 this sequence version replaced gi: 14424611. 
Contact: MGC help desk 
Email : cgapbs-r@mail . nih . gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC) , 

Gaithersburg, Maryland; 

Web site: http://v7ww.nisc.nih.gov/ 

Contact : nisc_mgc@nhgri . nih . gov 

Akhter,N., Ayele,K., Beckstrom-Sternberg, S .M. , Benjamin, B., 
Blakesley, R.W. , Bouf f ard, G. G. , Breen,K., Brinkley,C., Brooks, S., 
Dietrich,N.L. , Granite, S., Guan,X., Gupta, J., Haghighi,P., 
Hansen, N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., 
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian, S . D . ,McCloskey , J. C . , 
McDowell, J., Pearson, R., Stantripop, S . , Thomas, P. J., Touchman, J. W. , 
Tsurgeon,C., Vogt,J.L., Walker, M. A., Wetherby, K. D . , Wiggins, L., 
Young, A., Zhang, L.-H. and Green, E.D. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 26 Row: g Column: 22. 

Location/Qualifiers 

1. .2688 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db xref="taxon: 9606" 



/clone="IMAGE: 4126760" 
/tissue_type="Brain, neuroblastoma" 
/ clone_Tib= "NIH_MGC_1 9 " 
/lab__host="DH10B-R" 
/note="Vector: pOTB7" 
CDS <1. .1627 

/codon_start=2 

/product="Unknown (protein for IMAGE : 4126760 ) " 

/protein_id="AAH09333.2" 

/db_xref="GI: 40226528" 

/translation="DVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILT 
SGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPL 
GGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGI 
SLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILA 
MDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGR 
F7U.VGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQ 
LIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFT 
LERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPS 
AFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 
RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC" 

misc_f eature 413. .724 

/note="ZU5; Region: Domain present in ZO-1 and Unc5-like 

netrin receptors" 

/db_xref="CDD: smart00218" 
misc_feature 1343. .1594 

/note="DEATH; Region: DEATH domain, found in proteins 

involved in cell death (apoptosis)" 

/db_xref="CDD: smartOOOOS" 

ORIGIN 

Query Match 59.4%; Score 1635,4; DB 9; Length 2688; 

Best Local Similarity 99.9%; Pred. No. 1.2e-264; 

Matches 1636; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

GGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCT 1175 

I I I I I I I I M I I 11 I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M M I I 

GGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCT 60 
TGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTC 1235 

I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I M I I I I I I I I I I I M I I I I I I 

TGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTC 12 0 
GTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCC 1295 

I I I I I I I I M I I I M I I I I I M I M I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I 

GTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCC 180 
CCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCT 1355 

M I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I M M M I M I I I I I M I I I 

CCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCT 24 0 

CTGTCCCCGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAG 1415 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
CTGTCCCCGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAG 300 

CCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGA 1475 

I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I M I I I I I I I I I I M I I I I I M I 

CCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGA 360 



Qy 


1116 


Db 


1 


Qy 


1176 


Db 


61 


Qy 


1236 


Db 


121 


Qy 


1296 


Db 


181 


Qy 


1356 


Db 


241 


Qy 


1416 


Db 


301 



Qy 


1476 


Db 


361 


Qy 


1536 


Db 


421 


Qy 


1596 


Db 


481 


Qv 


1656 


Db 


541 


Qy 


1716 


Db 


601 


Qv 


1776 


Db 


661 


Qv 


1836 


Db 


721 


Qy 


1896 


Db 


781 


Ov 


1956 


Db 


841 


Qv 


2016 


Db 


901 


Qy 


2076 


Db 


961 


Qy 


2136 


Db 


1021 


Qy 


2196 


Db 


1081 


Qy 


2256 


Db 


1141 



GTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAA 1535 

I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I M I M I I I 

GTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAA 420 
CATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTAT 1595 

I I M I I I I I I I I I I I I I I I I I I I I M I M I I I I I I M I I M I I I I I I I I M I I I I I I II 

CATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAAT 480 

CAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCAC 1655 

I I I I I I I I I I I I I I I I I I I I I M M I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

CAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCAC 54 0 
GCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCC 1715 

IIIIIIIIIIMIMIIIIIIIMMIIIIIIIIMIMIMIIII Illllllll 

GCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCC 600 

CATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGA 1775 

I I I I I I M I I I I I I I I I I M I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I 

CATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGA 660 

CCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGA 1835 

I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I 

CCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGA 720 

GGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTG 1895 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I M I I I I I I I I 

GGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTG 780 
CCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGT 1955 

MIIIIIIIIMIIIIIIIIIIIIMIIIIIMIMIIMMIIIIIIIMMIIIIIII 

CCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGT 840 
GGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGT 2015 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I M I I I I 

GGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGT 900 
GGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGC 2075 

I M I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I 

GGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGC 960 

ACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACG 2135 

I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I 
ACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACG 102 0 

GGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAG 2195 

I I I M I M I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I 

GGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAG 1080 
CTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTG 2255 

I I I I I I I I I I I I I I M I I I M I I I I I I I I I M I I I M M I I I I I I I I I I I I M I I I M I I 

CTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTG 1140 
GAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCAC 2315 

I I I M I M I I I I M I I I 1 I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I 

GAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCAC 1200 



Qy 


2316 


Db 


1201 


Qy 


2376 


Db 


1261 


Qy 


2436 


Db 


1321 


Qy 


2496 


Db 


1381 


Qy 


2556 


Db 


1441 


Qy 


2616 


Db 


1501 


Qy 


2676 


Db 


1561 


Qy 


2736 


Db 


1621 



TAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAG 2375 

I I I M I I I I I M I I I I I I M I I I MUM M M M M M I M M M I M M M I 

TAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAG 1260 
CATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGA 2435 

I M M M M M M M M M M M M M M M M M M M M M M M M M M M M M I 

CATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGA 1320 
AGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCA 2495 

M M M M M M M M M M M M M M I M I M M I M M M M M M M M M 

AGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCA 1380 
GAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGC 2555 

I I I I I I I I I I M M M M M M M M M M M M M M M M M M M M M M M M M 

GAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGC 1440 

CCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCT^AGCCCAGCCCCACAGC 2615 

I I I I M I I I I M M M M M M M I M M M M M M M M I M 

CCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGC 1500 
CATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGC 2675 

I I M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 

CATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGC 1560 

TGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGA 2735 

I I I I I I I I I I I I I M M M M M M M M M M M M M M M M M M M M M M M I 

TGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGA 1620 
GTGCTGAGGCCGGCCAG 2752 

M M M I M M M M M 

GTGCTGAGGCCGGCCAG 1637 
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BD057525 

LOCUS 
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KEYWORDS 
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AUTHORS 
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COMMENT 



BD057525 1787 bp DNA linear PAT 27-AUG-2002 

Netrin receptors. 

BD057525 

BD057525.1 GI: 22603131 

JP 2001505062~A/2. 
synthetic construct 
synthetic construct 
artificial sequences. 
1 (bases 1 to 1787) 

Lavigne.M.T. , Leonardo, D. E . ^ Hinck,L., Masu,M. andMasu,K.K. 
Netrin receptors 

Patent: JP 2001505062-A 2 17-APR-2001; 

THE REGENTS OF THE UNIV OF CALIFORNIA 

PN JP 2001505062-A/2 

PD 17-APR-2001 

PF 19-FEB-1998 JP 1998536840 

PR 19-FEB-1997 US 08/808982 

PI MARC TESSIER LAVIGNE, DAVID E LEONARDO, LINDSAY HINCK,MASAYUKI 
PI MASU, 

PI KAZUKO KEINO MASU 

PC C07Kl/00,C07Kl4/00,C07K17/00,C07H21/02,C07H21/04,G01N33/53 CC 
Strandedness : Double; 



CC Topology: Linear; 

FH Key Location/Qualif iers . 

FEATURES Location/Qualifiers 
source !• .1787 

/organism="synthetic construct" 
/inol_type="genomic DNA" 
/db_xref="taxon: 32630" 

ORIGIN 

Query Match 56.8%; Score 1562.4; DB 6; Length 1787; 

Best Local Similarity 98.5%; Pred. No. 2.3e-252; 

Matches 1661; Conservative 0; Mismatches 16; Indels 9; Gaps 8; 

Qy 107 0 GCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCT 1129 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I 
Db 1 GCAACTGTACCAGTGACCTCTG-GTACACACTGCTTCTGGCCCTGAGGACGTGGCCCTCT 59 

Qy 1130 ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 1189 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

Db 60 ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 119 

Qy 1190 TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 124 9 

I I I M I I I I I I I I I I I I I M I I I I I I I I I I M M I I I I I M M I M I I I I N I I I I I N I 

Db 120 TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 179 

Qy 1250 CAGGCTTCCAGCCCGTCAGCATC-AAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 1308 

I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 CAGGCTTCCAGCCCGTCAGCATCTAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 239 

Qy 1309 ATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 1368 

M M I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I M I I I I I I I I I I I I I I M I 

Db 240 ATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 299 

Qy 1369 GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 1428 

I I I M M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I 

Db 300 GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 359 

Qy 1429 GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 1488 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 360 GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 419 

Qy 1489 CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 1548 

M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 420 CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 479 

Qy 1549 ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATC 1608 

I I I I I I M I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 48 0 ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATC 539 

Qy 1609 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 1668 

MIMIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIII 

Db 540 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 599 

Qy 1669 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 1728 

I I I I M I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I M I 

Db 600 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 659 



Qy 


1729 


Db 


660 


Qy 


1789 


Db 


720 


Qy 


1849 


Db 


779 


Qy 


1909 


Db 


838 


Qy 


1969 


Db 


898 


Qy 


2029 


Db 


958 


Qy 


2089 


Db 


1018 


Qy 


2149 


Db 


1077 


Qy 


2209 


Db 


1135 


Qy 


2269 


Db 


1195 


Qy 


2329 


Db 


1255 


Qy 


2389 


Db 


1315 


Qy 


2449 


Db 


1375 


Qy 


2509 


Db 


1435 


Qy 


2569 



GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I M I I I I I I M I I I I I M I I I I M I 

GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 



1788 
719 



CCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAG 1848 

M I I I I I I M I I I I I I I I I M I I I I I 1 I I I I I I I I I I I I I I I I M I I I I I I I I I I M 

CCCAGCCCTGACAGCTGGAGCCTGGCCCTCAAAAAGCAGTCGTGCGAGGG-AGCTGGGAG 778 

GATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 1908 

Mill I II I I I I II I I I I I I I I I I M I M M M I I I I I I I I I I I I I I I I I I I I I I I I I I 

GATGT-CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 837 
AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 1968 

I I I I I I I I I I I II I II II I I I I I I I I I I I I I M M I II II I I I I II I I I I I I I I I I M I I 

AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 897 

AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 2 028 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I II II M I I II II 

AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 957 

CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 2088 

M I I I I I I I II I I I I I I I I I I I I I I I I I I I I II II I I I II I I I I I I I I I I I I II I I I I I I 
CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 1017 

GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTC 214 8 

I M I I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M II I II I I I II 

GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTT- 107 6 

AAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAG 2208 

I I I I I I II I I M I I I I I I I I III I I I I I I I I I I I I I I I I I I II I I I I I I I II I I 

TVAGGACAGTTACCACAACCT— GCCCTATCATCCACGATGTGCCCAGCTCCCTGTGGAAG 1134 

AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 2268 

IIIIIIIIMIIIIIMIIIIIIIIMMIIIIIIIIMIIIIIIIIIIIIIIIMIIM 

AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 1194 
CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 2328 

I I I I I I M I II I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I 

CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 1254 



TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 2 38 8 

I II II I II I I I I I I I I I I I I .1 II I I I I I M I I I I I I I I I M I I I I I I I II I I I II 

TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 



1314 



MIIIIMIMIIIIMIIIIIIIIIMIIIIMIIIMIIIIIIIIIIIIMIIIIIII 

ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 1374 

GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 2508 

I I I I I I I I I II I I I I I I I I I I II I II I M I I I I I I I I I I I I I I I I I I I M I I M I I I I I I 
GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 1434 

AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 2568 

I I I I I I I I I II I I I II I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 1494 

CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 262 8 



Db 


1495 


Qy 


2629 


Db 


1555 


Qy 


2689 


Db 


1615 


Qy 


2747 


Db 


1675 



IIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIMIIMIIIMMIM Mill 

CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 1554 
CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 268 8 

I I I I M I I I I M I M I I I I I M I I I I II I I I M M M I I I I I I I I I I I I M I I II II I II 

CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 1614 
GGACTGGGCCAGCCAGACGCTGGCCTC-TTCACAGTG-TCGGAGGCTGAGTGCTGAGGCC 274 6 

M I I I I I M I I II I I I I M II M I I I I I I I I 1 I M I I I II I I I I I I II I I 

GGGACTGGCCAGCAGGACGGTGGCTTCTTTCACAGTGTTCGGAGGCTGAGTGCTGAGGCC 1674 



MINI 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



linear PAT 13-JAN-2001 



AX054976 9700 bp DNA 

Sequence 91 from Patent WO0073328. 

AX054976 

AX054 976.1 01:12228344 

synthetic construct 
synthetic construct 
artificial sequences. 

1 

van Criekinge,W., Roelens.I., Bogaert,T. and Verwaerde, P . 
Unc-5 constructs and screening methods 
Patent: WO 0073328-A 91 07-DEC-2000; 
Devgen NV (BE) 

Location/Qualifiers 

1. .9700 

/organism=" synthetic construct" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 32630" 
/note="plasmid pGCl037" 



Query Match 47 . 4%; 

Best Local Similarity 99.8%; 
Matches 1306; Conservative 



Score 1304.2; DB 6; 
Pred. No. 4e-209; 
0; Mismatches 3; 



Length 9700; 
Indels 0; Gaps 



Qy 


1437 


Db 


7 


Qy 


1497 


Db 


67 


Qy 


1557 


Db 


127 



CACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCAC 

I I I I I I I I I M M I I I I I I I I I II M II I I I I I M I I II I II M I I I I I I I I II II M II 

CACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCAC 
CCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAA 

I I I I I I I I I I I II M I I I 1 I I M II I I M I I I II I I I M I I II M I I I I I I I I I I I I II I 

CCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAA 
CTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGA 

I I I I M I I I M II I I I II I I I I II M I I M M I II I II I M I I II I I II I I II II I I M 

CTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATCCCCCCAGA 



0; 



1496 



66 



1556 



126 



1616 



186 



1617 TGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGT 1676 
I I I I I I I I M I II I I I II II 1 I I I I I I I I I II II I I I I I I I II I I II I I I M I I I I I I I I 



Db 


187 


TGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGT^AGACGT 


246 


Qy 


1677 


GAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCC 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCC 


1736 


Db 


247 


306 


Qy 


1737 


TGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCC 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCC 


1796 


Db 


307 


366 


Qy 

Db 


1797 
367 


TGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCT 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 

TGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCT 


1856 
426 


Qy 


1857 


GCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTG 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

GCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTG 


1916 


Db 


427 


486 


Qy 


1917 


CTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGC 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGC 


1976 


Db 


487 


546 


Qy 


1977 


TGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 M M 

TGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTA 


2036 


Db 


547 


606 


Qy 

Db 


2037 
607 


CAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 
CAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCT 


2096 
666 


Qy 


2097 


GGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAG 

1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 

GGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAG 


2156 


Db 


667 


726 


Qy 


2157 


TTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCT 

M 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 11 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCT 


2216 


Db 


727 


786 


Qy 


2217 


CCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTT 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

CCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTT 


2276 


Db 


787 


846 


Qy 


2277 


GCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCT 

1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

GCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCT 


2336 


Db 


847 


906 


Qy 


2337 


GTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAA 

1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 M 1 1 M 1 1 M 1 1 1 

GTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCTUV 


2396 


Db 


907 


966 


Qy 


2397 


GGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGT 

1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M 11 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 > 1 1 

GGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGTUs^GCGGGGGTCCAAGCCCTGGT 


2456 


Db 


967 


1026 


Qy 


2457 


GGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGA 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

GGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGA 


2516 


Db 


1027 


1086 



Qy 2517 CCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAG 2576 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1087 CCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAG 1146 

Qy 2577 CCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGA 2636 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I 
Db 1147 CCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGA 12 06 

Qy 2637 GGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGG 2696 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M 

Db 12 07 GGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGG 1266 

Qy 2697 CCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 2745 

I I I I I I M I I I I I I I I I M I I M M I I I I I I I I I I M I I I I 

Db 1267 CCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGTC 1315 



RESULT 13 

MMU72634 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



MMU72634 9299 bp mRNA linear ROD 13-MAY-1997 

Mus musculus rostral cerebellar malformation protein (rem) mRNA, 
complete cds . 

U72634 

U72634.1 GI:2088526 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (bases 1 to 9299) 

Ackerman.S.L., Kozak,L.P., Przyborski, S . A. , Rund,L.A., Boyer,B.B. 
and Knowles^B.B, 

The mouse rostral cerebellar malformation gene encodes an 

UNC-5-like protein 

Nature 386 (6627), 838-842 (1997) 

97271898 

9126743 

2 (bases 1 to 9299) 

Ackerman, S.L. , Kozak,L.P., Rund^L.A. and Knowles,B.B. 

Direct Submission 

Submitted (25-SEP-1996) The Jackson Laboratory, 600 Main Street, 
Bar Harbor, ME 04609, USA 

Location/ Qualifiers 
1. .9299 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57B6/SJL" 
/db_xref="taxon: 10090" 
/chromosome="3" 
/map="between Amyl and Mpmv9" 
1. .9299 
/gene="rcm" 
151. .2946 
/gene="rcm" 

/function=" transmembrane receptor" 
/note="similar to C. elegans UNC-5" 



/codon_start=l 

/product="rostral cerebellar malformation protein" 
/protein__id="AAB54103 . 1" 
/db_xref="GI: 2088527" 

/translation="MRKGLRATAARCGLGLGYLLQMLVLPALALLSASGTGSAAQDDE 
FFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYCKASPATQIYFKCNSEWVHQ 
KDHWDERVDETSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYV 
RIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEDIIDPAEDRNFYI 
TIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRC 
GRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTSWSKWSTCGTECTH 
WRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQAAPDSDDVALYVGIVIAVTVCLA 
ITVWALFVYRKNHRDFESDIIDSSALNGGFQPVNIKAARQDLLAVPPDLTSAAAMYR 
GPVYALHDVSDKIPMTNSPILDPLPNLKIKVYNSSGAVTPQDDLAEFSSKLSPQMTQS 
LLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPNSGVSLLIPAGAIPQGRVYE 
MYVTVHRKENMRPPMEDSQTLLTPWSCGPPGALLTRPVILTLHHCADPSTEDWKIQL 
KNQAVQGQWEDWWGEENFTTPCYIQLDAEACHILTENLSTYALVGQSTTKAAAKRL 
KLAIFGPLCCSSLEYSIRVYCLDDTQDALKEVLQLERQMGGQLLEEPKALHFKGSIHN 
LRLSIHDIAHSLWKSKLLAKYQEIPFYHIWSGSQRNLHCTFTLERLSLNTVELVCKLC 
VRQVEGEGQIFQLNCTVSEEPTGIDLPLLDPASTITTVTGPSAFSIPLPIRQKLCSSL 
DAPQTRGHDWRMLAHKLNLDRYLNYFATKSSPTGVILDLWEAQNFPDGNLSMLAAVLE 
EMGRHETWSLAAEGQY" 

ORIGIN 



Query Match 36.0%; Score 992; DB 10; Length 9299; 

Best Local Similarity 62.7%; Pred. No. le-156; 

Matches 1660; Conservative 0; Mismatches 925; Indels 63; Gaps 



Qy 


149 


TGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACA 

IN III II 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

TTCCATCTGACCCACCTGAGCCATTGCCACACTTCCTCATTGAGCCCGAGGAAGCTTACA 


208 


Db 


308 


367 


Qy 


209 


TCGTCT^GAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCT 

1 II II II II 1 1 1 1 1 1 M M II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 
TTGTGAAGAACAAGCCTGTGAACCTGTATTGTAAAGCCAGCCCTGCCACCCAGATCTACT 


268 


Db 


368 


427 


Qy 


269 


TCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACG 

1 II 1 1 1 1 II 1 1 1 II 1 II 1 1 1 1 III 1 1 1 1 1 1 1 II III II 1 1 1 1 
TCAAGTGCAACAGCGAGTGGGTTCATCAGAAGGACCACGTAGTAGACGAGAGAGTAGATG 


328 


Db 


428 


487 


Qy 


329 


GGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGA 

II 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 
AAACCTCTGGTCTAATTGTGAGAGAAGTGAGCATTGAGATTTCACGCCAGCAGGTGGAGG 


388 


Db 


488 


547 


Qy 


389 


AGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCA 

1 1 1 II 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 

AACTGTTTGGGCCTGAAGATTACTGGTGCCAGTGTGTGGCCTGGAGCTCAGCAGGCACTA 


448 


Db 


548 


607 


Qy 


449 


CCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGC 

1 1 1 II 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M II 1 
CGAAGAGTCGGAAGGCATACGTGCGCATTGCGTATCTGCGGAAGACATTCGAGCAGGAAC 


508 


Db 


608 


667 


Qy 


509 


CGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGG 

1 III 1 II II 1 1 M II M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

CCTTGGGAAAGGAAGTGTCCTTGGAGCAGGAAGTCTTACTCCAGTGTCGGCCACCTGAAG 


568 


Db 


668 


727 


Qy 


569 


GCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGG 


628 



I I I M I I I I I I I i I I I I I I I I I I I Mill I 



Db 


728 


GGATCCCAGTGGCTGAGGTGGAATGGCTAAAGAATGAAGACATAATTGATCCTGCTGAAG 


787 


Qy 


629 


ACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTG 

II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ATCGGAACTTTTATATTACTATCGATCACAACCTGATCATCAAGCAAGCCCGACTCTCAG 


688 


Db 


788 


847 


Qy 


689 


ACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCG 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

ATACAGCAAATTATACCTGTGTTGCCAAAAATATTGTTGCCAAGAGAAAAAGCACCACAG 


748 


Db 


848 


907 


Qy 


749 


CTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCA 

1 1 1 1 1 1 1 1 1 M II II II 1 1 II 1 1 1 1 1 1 II 1 1 1 II II 1 II II 1 M M 1 
CCACTGTCATCGTGTATGTTAATGGTGGCTGGTCCACCTGGACAGAGTGGTCTGTGTGTA 


808 


Db 


908 


967 


Qy 


809 


GCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTC 

1 1 II 1 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

ACAGCCGCTGTGGGCGAGGATATCAGAAACGCACAAGAACCTGCACCAACCCAGCCCCAC 


868 


Db 


968 


1027 


Qy 


869 


TCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGT 

INI II II II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 III 1 1 1 1 1 1 1 III III 1 II 1 1 
TCAATGGTGGGGCCTTCTGTGAGGGGCAGAGTGTGCAGAAAATAGCATGCACTACGTTAT 


928 


Db 


1028 


1087 


Qy 


929 


GCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCA 

1 II II 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II M 

GTCCAGTGGATGGTAGGTGGACTTCATGGAGCAAATGGTCAACCTGTGGGACTGAATGCA 


988 


Db 


1088 


1147 


Qy 


989 


CCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCC 

1 1 1 1 1 II 1 M M 1 1 1 1 1 1 II M 1 M 1 1 1 1 1 1 II 1 1 1 1 M 1 1 

CCCACTGGCGCAGGAGGGAGTGTACAGCACCAGCCCCCAAGAACGGGGGTAAGGACTGTG 


1048 


Db 


1148 


1207 


Qy 


1049 


AGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTG 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ATGGCCTGGTCCTCCAATCCAAGAACTGCACTGATGGGCTGTGCATGCAGGCTGCTCCTG 


1108 


Db 


1208 


1267 


Qy 


1109 


GCCCTGAGGACGTGGCCCTCTATGTGGG CCTCATCGCCGTGGCCGTCTGCCTGGTCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 
ACTCAGATGATGTGGCTCTCTACGTGGGGATTGTGATCGCTGTAACAGTCTGTCTGGCGA 


1165 


Db 


1268 


1327 


Qy 


1166 


TGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATG 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 M 

TCACTGTTGTGGTGGCCCTGTTTGTGTATCGGAAGAACCACCGTGACTTTGAGTCTGACA 


1225 


Db 


1328 


1387 


Qy 


1226 


TGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAG 


1285 


Db 


1388 


1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TCATTGACTCCTCAGCACTCAATGGCGGCTTTCAGCCTGTGAACATCAAG - G 


1438 


Qy 


1286 


CAGACAACCCCCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACC 

1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 III III 
CTGCCAGACAAGATCTCCTGGCTGTCCCCCCTGACCTCACCTCAGCTGCAGCCATGTACA 


1345 


Db 


1439 


1498 


Qy 


1346 


AGGGCAGTCTCTGTCCCCGGCAGGATG GGCCCAGCCCCAAGTTCCAGCTCACCAA 

III 1 1 II 1 1 1 1 1 1 II M II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 
III 1 III 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 III ) 1 1 1 1 1 1 

GGGGACCTGTCTATGCTCTGCATGATGTCTCAGACAAAATCCCAATGACCAACTCTCCAA 


1400 


Db 


1499 


1558 


Qy 


1401 


TGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCAC 

1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TTCTGGACCCACTACCCAACTTGAAAATCAAAGTGTACAACAGCTCAGGTGCTGTCACTC 


1460 


Db 


1559 


1618 



Qy 


1461 


III III 1 1 1 II 1 1 III 1 1 II III 

CTCAGGATGACCTTGCCGAGTTCTCATCCAAACTGTCACCCCAGATGACCCAGTCCTTGC 


1498 


Db 


1619 


1678 


Qy 


1499 


1 1 1 1 1 1 1 1 1 1 1 III 1 
TAGAGAATGAGGCCCTTAACCTGAAGAACCAGAGCCTCGCAAGACAGACTGACCCATCCT 


1534 


Db 


1679 


1738 


Qy 


1535 


ACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTA 

II 1 1 III 1 1 1 1 II M 11 1 1 1 1 1 II 1 II M 1 1 1 1 1 1 1 1 1 1 M 

GCACAGCATTTGGTACCTTCAACTCTCTTGGGGGTCACCTCATCATTCCTAATTCAGGAG 


1594 


Db 


1739 


1798 


Qy 


1595 


TCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCA 

1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 II 
TAAGCTTGCTGATTCCCGCTGGGGCCATTCCTCAGGGGAGAGTCTATGAAATGTATGTGA 


1654 


Db 


1799 


1858 


Qy 


1655 


CGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTC 

1 1 1 II 1 1 II 1 1 Mill 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 
CTGTACACAGGAAAGAAAATATGAGGCCCCCCATGGAAGACTCTCAGACCCTACTTACCC 


1714 


Db 


1859 


1918 


Qy 


1715 


CCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGG 

1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 MM 

CTGTGGTGAGCTGTGGGCCTCCTGGAGCTCTGCTGACCCGCCCTGTCATCCTCACTCTGC 


1774 


Db 


1919 


1978 


Qy 


1775 


ACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCG 

1 1 II 1 II 1 M II 1 1 1 1 1 1 1 Mill II M M 1 II 1 1 II 1 
ATCACTGTGCAGACCCCAGCACCGAGGACTGGAAGATCCAGCTCAAAAACCAGGCAGTGC 


1834 


Db 


1979 


2038 


Qy 


1835 


AGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACT 

II II 1 1 II 1 1 1 1 II 1 II 1 M II 1 1 1 1 1 1 II II III 
AGGGACAATGGGAGGATGTTGTGGTGGTTGGGGAGGAGAACTTCACAACCCCCTGTTACA 


1894 


Db 


2039 


2098 


Qy 


1895 


GCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGG 

II 1 M II 1 II II II 1 1 1 1 1 II 1 II 1 1 M 1 II II 1 1 II M 
TTCAGCTGGATGCAGAGGCTTGCCATATCCTCACAGAGAACCTCAGTACCTATGCCCTGG 


1954 


Db 


2099 


2158 


Qy 


1955 


TGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGG 

III II 1 1 III II II II II 1 1 11 II 1 1 1 II 1 II 1 1 11 1 
TTGGGCAGTCCACCACCAAAGCAGCTGCCAAGCGTCTTAAACTGGCCATCTTTGGGCCCC 


2014 


Db 


2159 


2218 


Qy 


2015 


TGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATG 

1 1 1 M 1 M 1 M II 1 II M II 1 1 1 1 II II II 1 1 II II II M 1 1 1 II M 

TCTGCTGCTCTTCCCTGGAGTACAGCATTAGAGTCTACTGCCTGGATGACACACAGGATG 


2074 


Db 


2219 


2278 


Qy 


2075 


CACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCAC 

1 M 1 II M 1 1 1 II 1 II II M II 1 II II 1 M 1 M II 1 Mill 

CCCTGAAGGAAGTTCTACAACTGGAGAGGCAAATGGGAGGACAGCTCCTAGAAGAACCCA 


2134 


Db 


2279 


2338 


yy 


^ ± o -J 


GGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCA 

II II 1 1 II 1 1 1 1 II 1 1 II II 1 M II 1 1 1 1 1 1 M II II 1 II 

AGGCTCTTCATTTTAAAGGCAGCATCCACAACCTGCGCCTGTCTATTCATGACATCGCCC 


2194 


Db 


2339 


2398 


Qy 


2195 


GCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCT 

II 1 M M 1 M 1 1 1 II Mil 1 II M II 1 II II II II 1 II 1 II II 

ATTCCCTCTGGAAGAGCA/^TTGCTGGCTAAGTATCAGGATVATTCCATTTTACCACATCT 


2254 


Db 


2399 


2458 



QV 2255 GGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCA 2314 

I I I I I I I I I I I I I I I I I I M I M M I I I I I I I I I MINI III 
Db 2459 GGAGTGGCTCTCAAAGAAACCTCCACTGCACCTTCACTCTGGAAAGACTCAGCCTAAACA 



2518 
2374 



Ov 2315 CTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCA 

I I I II II M II I II II II I I II II II I I I I I I I I I II II II I I 

Db 2519 CAGTGGAACTGGTTTGCAAACTCTGTGTGCGGCAGGTTGAAGGAGAAGGGCAGATCTTCC 2578 

Ov 2375 GCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTG 2434 

I I I I M I I I II II I I MM II I II 11 

Db 2579 AGCTCAACTGTACTGTGTCAGAGGAACCTACTGGCATCGACTTACCTCTCCTGGACCCTG 2638 

Ov 2435 AAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGC 2494 

I I I I I I I M I I I I I M I I I I I I III 1 I I I II 

Db 2639 CTAGTACCATCACCACTGTCACCGGACCAAGTGCTTTCAGCATTCCTCTCCCTATCCGGC 2698 

Ov 2495 AGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGG 2554 

Mill II II II MM I I II I III Mill 

Db 2699 AGAAGCTATGCAGCAGCCTGGATGCCCCTCAAACAAGAGGCCATGACTGGAGGATGCTGG 2758 

Qv 2555 CCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAG 2614 

MM IIIIM IIMMIIM I II I IIIMM MM I Mill II I 

Db 2759 CCCATAAACTCAACCTGGACAGGTACTTGAATTACTTTGCCACCAAATCGAGCCCAACTG 2818 

Ov 2615 CCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGG 2 674 

I I M I II II M II I II I I I II I Mill 

Db 2819 GCGTAATCCTGGATCTTTGGGAAGCACAGAACTTCCCAGATGGAAACCTGAGCATGCTGG 2878 

Qy 2675 CTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTG 2734 

I II I Ml I I MM I M I I III I I II I 

Db 2879 CAGCCGTCCTGGAAGAAATGGGAAGACATGAGACAGTGGTGTCCTTGGCAGCAGAAGGAC 2938 

Qy 2735 AGTGCTGA 2742 

III Ml 
Db 2939 AGTATTGA 2946 
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DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 



AY187310 2962 bp mRNA linear VRT 06-JUN-2003 

Gallus gallus UNC5-like protein 3 mRNA, complete cds . 

AY187310 

AY187310. 1 GI: 31442350 

Gallus gallus (chicken) 

Gallus gallus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Archosauria; Aves; Neognathae; Galliformes; Phasianidae; 
Phasianinae; Gallus . 

1 (bases 1 to 2962) 
Guan^W. and Condic,M.L. 

Characterization of Netrin-1, Neogenin and cUNC-5H3 expression 
during chick dorsal root ganglia development 
Gene Expr. Patterns 3, 369-373 (2003) 

2 (bases 1 to 2962) 
Guan,W. and Condic,M.L. 
Direct Submission 



JOURNAL 

FEATURES 

source 



CDS 



Submitted (2 6-NOV-2 0 02 ) Neurobiology 
20 North, 1900 East, Salt Lake City, 

Location/Qualifiers 

1. .2962 

/organism="Gallus gallus" 
/mol_type="mRNA" 
/db_xref="taxon: 9031" 
1. .2796 
/codon_start=l 
/product="UNC5-like protein 
/protein_id="AA067275-l" 
/db xref="GI: 31442351 



& Anatomy, University of Utah, 
UT 84132-3401, USA 



3" 



/translation="MGKGLEGTAARCGLGMGYLLHSWLPALAVLGASRPGSAAQDDD 
FFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYCKASPATQIYFKCNSEWVHQ 
KDHWDERVDETSGLIVCEVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYV 
RIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEEVIDPVEDRNFYI 
TIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSACNSRC 
GRGFQKRTRTCTNPAPLNGGAFCEGQNVQKIACTTLCPVDGKWTSWSKWSTCGTECTH 
WRRRECTAPAPKNGGKDCEGLVLQSKNCTDGLCMQAAPDSDDVALYVGIVIAVIVCLA 
ISVWALFVYRKNHRDFESDIIDSSALNGGFQPVNIKAARQDLLAVPPDLTSAAAMYR 
GPVYALHDVSDKIPMTNSPILDPLPNLKIKVYNTSGAVTPQDELSDFSSKLSPQITQS 
LLENETLNVKNQSLARQTDPSCTAFGTFNSLGGHLVIPNSGVSLLIPAGAVPQGRVYE 
MYVTVHRKEGMRPPVEDSQTLLTPWSCGPPGALLTRPWLTMHHCAEPNMDDWQIQL 
KHQAGQGPWEDWWGEENFTTPCYIQLDPEACHILTETLSTYALVGQSITKAAAKRL 
KLAIFGPLSCSSLEYSIRVYCLDDTQDALKEVLQLERQMGGQLLEEPKTLHFKGSTHN 
LRLSIHDIAHSLWKSKLPAKYQEIPFYHIWSGCQRNLHCTFTLERFSLNTLELVCKLC 
VRQVEGEGQIFQLNCSVSEEPTGIDYPIMDSAGSITTIVGPNAFSIPLPIRQKLCSSL 
DAPQTRGHDWRMLAHKLKLDRYLNYFATKSSPTGVILDLWEAQNFPDGNLSMLAAVLE 

EMGRHET WS LAAEGN Y " 



ORIGIN 



Query Match 36.0%; 
Best Local Similarity 62.6%; 
Matches 1662; Conservative 



Score 991; DB 5; Length 2962; 
Pred. No. 1.7e-156; 
0; Mismatches 930; Indels 63; 



Gaps 



Qy 

Db 

QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



149 TGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACA 

I I I I M I I I I I III Ill I II II III II I I I I I 

158 TTCCTTCTGATCCTCCAGAGCCATTGCCCCACTTTCTCATTGAACCCGAAGAAGCTTACA 

2 09 TCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCT 

M II II I I I II II I III 11 I M I I I I I I I I I I I I I I I I I I I I 

218 TCGTGAAAAACT^GCCTGTGAATCTGTACTGCAAAGCGAGCCCTGCCACGCAGATCTATT 

269 TCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACG 

I M I II II I I I I I I II I I I III I I I II I I I I I I II I I I I 

278 TTAAGTGCAACAGTGAATGGGTTCATCAGAAGGATCATGTGGTGGATGAGAGAGTAGATG 

329 GGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGA 

II I M I M I I Mill MM I III I II II I II I I I 

338 AAACCTCTGGTCTGATCGTCTGTGAGGTGAGCATCGAGATTTCCCGCCAGCAGGTGGAAG 
389 AGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCA 

II I II M I M I M II II II II I I I I I I I I M I I II I I I I I I I I M I I 

398 AGCTCTTTGGACCCGAGGACTACTGGTGCCAGTGTGTCGCCTGGAGCTCAGCTGGCACCA 
449 CCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGC 

I I I II II I I M I I I I I I I I M I I I M I I I I II M I I I I I M II I 



5; 



208 



217 



268 



277 



328 



337 



388 



397 



448 



457 



508 



Db 


458 


CCT^GAGCCGCAAGGCCTACGTCCGCATTGCATATCTCAGAAAGACTTTTGAGCAGGAGC 


517 


Qy 


509 


CGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGG 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGCTGGGGAAAGAAGTGTCCCTGGAGCAAGAGGTCCTGCTCCAGTGCCGTCCTCCTGAAG 


568 


Db 


518 


577 


Qy 


569 


GCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGG 


628 


Db 


578 


1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

GCATTCCAGTAGCTGAGGTAGAGTGGCTG7\AGAATGAAGAGGTGATCGATCCTGTGGAAG 


637 


Qy 


629 


ACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTG 

III 1 II 1 1 1 1 1 M 1 1 II M M 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 
ACCGAAATTTTTACATCACCATTGATCACAACCTGATCATCAAGCAAGCCCGGCTTTCCG 


688 


Db 


638 


697 


Qy 


689 


ACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCG 

1 1 1 M II 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 

ACACGGCTAACTACACCTGTGTTGCCAAAAACATTGTGGCCAAAAGGAAAAGCACGACAG 


748 


Db 


698 


757 


Qy 


749 


CTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCA 

1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 11 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 II II 
CAACTGTGATTGTCTATGTGAATGGAGGCTGGTCTACCTGGACCGAGTGGTCAGCGTGCA 


808 


Db 


758 


817 


Qy 


809 


GCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTC 

1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 
ACAGCCGCTGTGGGAGAGGCTTCCAGAAGCGCACAAGGACCTGCACTAACCCTGCCCCAC 


868 


Db 


818 


877 


Qy 


869 


TCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGT 

MM Mill II Mill llllllll Mill MIMII III Ml MIMIMI 

TCAATGGGGGGGCCTTCTGCGAGGGGCAAAATGTTCAGAAAATAGCTTGCACCACCCTGT 


928 


Db 


878 


937 


Qy 


929 


GCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCA 

1 M 1 II 1 1 M II MM 1 II II II 1 M M II 1 1 II M 1 II 1 1 1 

GTCCAGTGGATGGCAAATGGACGTCCTGGAGCAAGTGGTCCACTTGTGGCACAGAGTGTA 


988 


Db 


938 


997 


Qy 


989 


CCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCC 

II II M M II 1 1 M II M 1 1 MUM II II II 1 MUM 

CCCACTGGCGCCGGAGGGAGTGCACAGCTCCGGCCCCGAAGAATGGAGGCAAGGACTGTG 


1048 


Db 


998 


1057 


Qy 


1049 


AGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTG 

1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 M Mill III III III 

AGGGACTGGTGCTGCAGTCTAAGAACTGCACTGATGGGCTCTGCATGCAGGCTGCACCTG 


1108 


Db 


1058 


1117 


Qy 


1109 


GCCCTGAGGACGTGGCCCTCTATGTGGG CCTCATCGCCGTGGCCGTCTGCCTGGTCC 

1 1 M II II M II 1 M II II 1 II 1 1 II II 1 II II II II 1 

ACTCGGATGATGTTGCTCTCTACGTGGGGATTGTCATTGCTGTGATTGTGTGCCTGGCTA 


1165 


Db 


1118 


1177 


Qy 


1166 


TGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATG 

1 MUM II 1 II M 1 1 1 III 1 1 1 II II 1 1 II 

TTTCTGTGGTTGTGGCCCTGTTTGTCTATCGCAAGAACCACCGTGACTTTGAGTCAGATA 


1225 


Db 


1178 


1237 


Qy 


1226 


TGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAG 

1 t 1 1 1 I 1 i Ml M II II II 1 II 1 II II 1 M 1 II 
1 i 1 1 1 1 11 II 1 ' ' ' ' 1 1 1 1 1 II 1 1 1 1 1 1 1 1 i ' ■ 

TTATCGACTCATCGGCGCTAAATGGGGGATTTCAGCCTGTTAACATCAAGGCTGCAAGAC 


1285 


Db 


1238 


1297 


Qy 


1286 


CAGACAACCCCCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACC 

II II II 1 1 II MIMIMI 1 1 1 Ml III 

AAGAC CTCTTGGCAGTGCCACCAGACCTCACTTCTGCTGCAGCCATGTACA 


1345 


Db 


1298 


1348 



Qy 1346 AGGGCAGTCTCTGTCCCCGGCAGGATG GGCCCAGCCCCAAGTTCCAGCTCACCAA 1400 

III I I I I II III MM I I MMI III M II I 

Db 1349 GGGGGCCTGTGTATGCCTTGCATGATGTCTCTGATAAAATCCCAATGACCAATTCTCCGA 1408 

Qy 1401 T GGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTC 1456 

I I I I M I I I I 11 I MM III 

Db 1409 TCCTGGACCCACTGCCCAATCTGAAGATTAAAGTTTATAACACCTCTGGAGCAGTCACCC 1468 

Qy 1457 CCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCC 1498 

II III I MMI MM II I M I I 

Db 14 69 CCCAGGATGAACTCTCTGACTTCTCCTCCAAGCTGTCCCCACAGATTACCCAGTCTCTGT 1528 
Qy 1499 AGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCA 1534 

M MM MM I 

Db 1529 TGGAGAATGAGACTCTGAACGTGAAGAACCAAAGCCTTGCACGGCAAACAGACCCATCCT 1588 

Qy 1535 ACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTA 1594 

M I M II M M M M I II M I II I M M M II I M II M I I 

Db 1589 GCACTGCATTTGGGACCTTCAACTCGTTAGGGGGCCACCTAGTAATTCCTAATTCAGGAG 1648 

Qy 1595 TCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCA 1654 

I MM II Mill Ml II Mill MM lllllll II II II 

Db 1649 TGAGCTTGCTGATCCCAGCAGGGGCTGTTCCCCAAGGAAGAGTCTATGAAATGTATGTGA 1708 

Qy 1655 CGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTC 1714 

I llllll Ml M MM II III III Mill lllllll I 

Db 17 09 CAGT CCACAGG7VAGGAGGGCATGAGACCACCTGTAGAAGACAGCCAGACGCTGCTGACAC 1768 

Qy 1715 CCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGG 1774 

I Ml MMM MM llllll Mill III 

Db 1769 CAGTGGTGAGCTGTGGCCCACCAGGAGCGCTGCTGACCCGACCCGTTGTGCTGACCATGC 1828 

Qy 1775 ACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCG 1834 

M M II III II II II II II I I II M I II I I Mill II 

Db 1829 ACCACTGTGCTGAGCCCAACATGGATGACTGGCAGATCCAGCTCAAGCACCAGGCAGGCC 1888 

Qy 1835 AGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACT 1894 

II II II II II 11 M I II II II II I I II I I Mill 

Db 1889 AGGGACCATGGGAGGATGTAGTGGTGGTCGGGGAGGAAAACTTCACCACTCCATGCTACA 1948 

Qy 1895 GCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGG 1954 

M II I Ml I I II I II I I II II I M M II I llllll 

Db 1949 TCCAGCTGGACCCAGAGGCCTGTCATATCCTGACGGAGACCCTCAGCACGTACGCCTTGG 2008 

Qy 1955 TGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGG 2 014 

MMM MMM II I M I I II II I M II II Mill II 
Db 2009 TGGGACAATCCATCACCAAAGCAGCAGCCAAACGTCTCAAATTGGCCATCTTTGGACCAC 2068 

Qy 2015 TGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATG 2074 

llllll II II II I II M II I M M II II I II III I I II II I II I II M I 

Db 2069 TGTCCTGTTCCTCACTGGAGTACAGCATCCGCGTCTACTGCCTCGATGACACACAGGATG 2128 

Qy 2075 CACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCAC 2134 

II II I II I I II I M II II I I M M I II II II II II M llllll 
Db 2129 CCCTGAAGGAGGTCCTCCAGCTTGAGCGGCAGATGGGTGGGCAGCTGTTGGAGGAACCCA 2188 



Qy 2135 GGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCA 2194 

I M I I I M I I III I M I M I I I I I I I I I M I I I M I I I II 

Db 2189 AAACTTTGCATTTTAAGGGAAGTACCCACAACCTGCGCTTATCCATTCATGACATTGCCC 224 8 

Qy 2195 GCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCT 2254 

III II II I II II I II II I I I I I I I I I I I I I I 1! I I I I I I M I I II 
Db 2249 ACTCTCTCTGGAAGAGCAAACTGCCGGCTAAATACCAGGAGATTCCTTTTTACCACATCT 2308 

Qy 2255 GGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCA 2314 

II I I I I II I II II II I II I I I II II II I I I I I I I I I I I I I I I I I I 

Db 2309 GGAGTGGGTGCCAGAGGAACTTGCACTGCACCTTCACGCTGGAACGATTCAGTCTCAATA 2368 

Qy 2315 CTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCA 2374 

I II II I I I M M II II Ml I I I I II II II II I I II I I I I II I 

Db 2369 CCCTGGAGCTCGTCTGCA7VACTCTGTGTGCGGCAAGTCGAAGGAGAAGGGCAGATCTTCC 2428 

Qy 2375 GCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTG 2434 

I II II I I I I I II I I Ml I I I I I I 

Db 2429 AGCTGi\ACTGCTCAGTATCAGAGGAACCCACTGGCATTGATTATCCCATCATGGATTCAG 248 8 

Qy 2435 AAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGC 2494 

II I I I I I I I I I II I I I I I I I II I I I I I II I II II III 
Db 2489 CAGGCAGCATCACAACGATAGTTGGGCCCAACGCTTTCAGCATCCCCCTCCCAATAAGGC 2548 

Qy 2495 AGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGG 2554 

I M I I I I I I I II I I I I I I I I I I I I I I I I II I I II 

Db 2549 AGAAGCTCTGCAGCAGCCTGGATGCACCCCAGACCCGGGGCCATGACTGGAGGATGCTGG 2608 

Qy 2555 CCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAG 2614 

I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I I I I 
Db 2609 CCCACAAGCTGAAATTGGACAGGTACCTAAATTATTTTGCTACGAAGTCGAGTCCCACTG 2668 

Qy 2615 CCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGG 2674 

II I II Mill II I I I Mill I MIMIII III Mill 

Db 2669 GGGTGATCCTGGATCTCTGGGAAGCCCAGAATTTCCCTGATGGCAACCTGAGCATGCTGG 2728 

Qy 2675 CTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTG 2734 

I M M III I I M II I I II II II I I II I 

Db 272 9 CAGCAGTGCTGGAAGAAATGGGACGACATGAAACCGTTGTTTCTTTGGCAGCAGAAGGAA 2788 

Qy 2735 AGTGCTGAGGCCGGC 274 9 

I I II II M I I 
Db 2789 ATTACTGATGCAGCC 2803 
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M118026 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



ROD 19-DEC-2003 



AB118026 9328 bp mRNA linear 

Rattus norvegicus Unc5h3 mRNA, complete cds . 
AB118026 

AB118026.1 GI: 40217509 



Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
CDS 



polyA_signal 
ORIGIN 



Kuramoto.T., Kuwamura,M. and Serikawa,T. 

Rat neurological mutations cerebellar vermis defect and hobble are 

caused by mutations in the netrin-1 receptor gene Unc5h3 

Mol. Brain Res. (2003) In press 

2 (bases 1 to 9328) 

Kuramoto,T. and Serikawa^T. 

Direct Submission 

Submitted (22-AUG-2003 ) Takashi Kuramoto, Institute of Laboratory 
Animals, Graduate School of Medicine, Kyoto University; 
Yoshidakonoe-cho, Sakyo-ku, Kyoto 606-8501, Japan 
(E-mail : tkuramot@anim.med.kyoto-u.ac.jp, 
URL: www. anim.med. kyoto-u. ac. jp, Tel : 81-75-753-4494 , 
Fax:81-75-753-4409) 

Location/Qualifiers 
1. .9328 

/organism="Rattus norvegicus" 

/mol_type="mRNA" 

/db_xref="taxon: 10116" 

/ chr omo s ome= " 2 " 

1. .9328 

/gene="Unc5h3" 

151. .2946 

/gene="Unc5h3" 

/ codon_s tart=l 

/protein_id="BAD05181.1" 

/db_xref-"GI : 40217510" 

/translation="MRKGLRATAARCGLGLGYLLQMLVLPALALLSASGTGSAAQDDD 
FFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYCKASPATQIYFKCNSEWVHQ 
KDHWDERVDETSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYV 
RIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPMAEVEWLKNEDIIDPVEDRNFYI 
TIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWAEWSVCNSRC 
GRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTSWSKWSTCGTECTH 
WRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQAAPDSDDVALYVGIVIAVTVCLA 
ITVWALFVYRKNHRDFESNIIDSSALNGGFQPVNIKAARQDLLAVPPDLTSAAAMYR 
GPVYALHDVSDKIPMTNSPILDPLPNLKIKVYNSSGAVTPQDDLAEFSSKLSPQMTQS 
LLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPNSGVSLLIPAGAIPQGRVYE 
MYVTVHRKENMRPPMEDSQTLLTPWSCGPPGALLTRPVILTLHHCADPNTEDWKIQL 
KNQAVQGQWEDWWGEENFTTPCYIQLDAEACHILTENLSTYALVGQSTTKAAAKRL 
KLAIFGPLCCSSLEYSIRVYCLDDTQDALKEVLQLERQMGGQLLEEPKALHFKGSIHN 
LRLSIHDITHSLWKSKLLAKYQEIPFYHIWSGSQRNLHCTFTLERLSLNTVELVCKLC 
VRQVEGEGQIFQLNCTVSEEPTGIDLPLLDPASTITTVTGPSAFSIPLPIRQKLCSSL 
DAPQTRGHDWRMIAHKLNLDRYLNYFATKSSPTGVILDLWEAQNFPDGNLSMLAAVLE 
EMGRHET WS LAAE GQ Y " 
2947. .9328 
/gene="Unc5h3" 
8727. .8772 

/note="simple sequence repeats" 
/rpt_unit="gt" 
9309. .9314 
/gene="Unc5h3" 



3'UTR 

repeat_region 



Query Match 35.8%; Score 986; DB 10; Length 9328; 

Best Local Similarity 62.7%; Pred. No. le-155; 

Matches 1651; Conservative 0; Mismatches 920; Indels 63; Gaps 



5; 



163 CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAG^ACAAG 222 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
322 CCTGAGCCATTACCACACTTTCTCATTGAGCCCGAGGAAGCTTACATTGTGAAGAACAAG 381 

223 CCAGTGCTGCTTGTGTGC7^GGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGC7\ACGGG 282 

I I I I I II I I I I M I II I I I II II I I I I I I I I I I I I I I I I II I 

382 CCTGTGAACCTGTATTGTAAAGCCAGCCCTGCCACCCAGATCTACTTCAAGTGCAACAGC 441 

283 GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 342 

I I I II II I III I II I II II I III II I II I II I I I I 

442 GAATGGGTTCATCAGAAGGACCACGTAGTAGACGAGAGGGTAGATGAAACCTCCGGTCTC 501 

343 CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTG 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
502 ATTGTGAGAGAAGTGAGCATTGAGATTTCACGCCAGCAGGTGGAGGAACTGTTCGGGCCT 561 

403 GAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAG 4 62 

II II I I I I I I I I I I I I I I I I I I I I I I II I M I I II I M M I I II I I I I I I I 

562 GAAGATTACTGGTGCCAGTGTGTGGCCTGGAGCTCAGCGGGCACTACGAAGAGTCGGAAG 621 

463 GCCTACATCCGCATAGCCAGATTGCGCTVAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAG 522 

II M I I I I I I I I I I I I I I I I I I I II I II I I I I I III I I I I 

622 GCATACGTGCGCATTGCGTATCTGCGGAAGACATTTGAGCAGGAACCCTTGGGAAAAGAA 681 

523 GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 582 

I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II 
682 GTGTCCTTGGAGCAGGAAGTGTTACTCCAGTGCCGGCCACCTGAAGGGATCCCAATGGCT 741 

583 GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 642 

I I I I I I I I I I I II I I I I I I I I I I Mill I III M I I I I 
742 GAGGTGGAGTGGCTAAAGAATGAAGACATAATTGATCCTGTCGAAGATCGGAATTTTTAT 801 

643 ATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTAC 7 02 

I I I I I I I I I II I I II I I I I I II M I I I I I II I I II I I 

802 ATCACTATCGATCACAACCTGATCATCAAGCAGGCACGACTCTCAGATACAGCGAATTAT 861 

703 ACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTC 762 

I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

862 ACCTGTGTTGCCATW^CATTGTTGCCAAGAGGAAAAGCACCACAGCCACCGTCATCGTG 921 

763 TACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGG 822 

M II II II II I II I I I II I I I I M mil N II I I I I N I M I I 

922 TATGTTAATGGTGGCTGGTCCACCTGGGCAGAATGGTCTGTGTGTAACAGCCGCTGTGGG 981 

823 CGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCT 882 

I I I I I I I I I I I I I I I I I II M I II I I I I I I I I I I I I I I I I I II 

982 CGAGGATATCAGAAACGCACAAGAACCTGCACCAACCCAGCCCCCCTCAATGGTGGGGCC 1041 

8 83 TTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGC 942 

II I I I I I I I I I I I I I III I I I I M I III Ml MM I M M I I I I II Ml 

1042 TTCTGCGAGGGGCAGAGTGTACAGAAAATAGCATGCACCACGTTATGCCCAGTGGATGGC 1101 
943 AGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGC 1002 

I I I I I I II I II II M I M II I I II II I I I I M II II I II I II M M I 

1102 AGGTGGACTTCGTGGAGCAAATGGTCAACTTGTGGGACTGAGTGCACCCACTGGCGGAGG 1161 
1003 CGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTG 1062 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



I I M I I II I I I I I I II I I I I I M I I II I I INI I I I I 

1162 AGGGAGTGTACAGCACCAGCCCCCAAGAACGGGGGTAAGGACTGTGATGGCCTGGTCCTC 1221 
1063 GACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTG 1122 

I II I I I M I I I II I I II IN I I I I I I I I I M I I I I I 

1222 CAATCCAAGAACTGCACCGATGGGCTGTGCATGCAGGCTGCTCCTGACTCAGATGACGTT 12 81 

1123 GCCCTCTATGTGGG CCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTC 117 9 

II I I II I I I I I I I I I II M I I I I II I I M I I III 

1282 GCTCTCTACGTTGGGATTGTGATCGCTGTAACCGTTTGTCTGGCAATCACTGTTGTGGTG 1341 

118 0 CTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCC 1239 

II I I II I M I I IN I I I I I II III I I I I I I I I 

1342 GCCCTGTTCGTGTATCGCAAGAACCACCGTGACTTTGAGTCTAATATCATTGACTCCTCA 14 01 



1240 



ATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCAT 12 99 

II I I I II I I I I I I I I I I I I I I I I I I I I I I I II 

1402 GCACTCAATGGCGGCTTTCAGCCTGTGAACATCAAG 



-GCTGCCAGACAAGAT 14 52 



1300 CTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGT 1359 

I I I I I II I I I I I I I I I I I I I III III III I I I I 

1453 CTCCTGGCAGTACCCCCGGATCTCACCTCAGCTGCAGCCATGTACAGGGGACCCGTCTAT 1512 

1360 CCCCGGCAGGATG GGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCA 1414 

II I I I I I I II I I I I I I Ml I I I I I I I I 

1513 GCTCTCCATGATGTCTCAGACAAAATCCCAATGACCAATTCTCCAATTCTAGACCCACTG 1572 



1415 GCCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTC TGAGGCC 

II II I I I I I I I I I I I I I I I I II 

1573 CCCAATTTGAAAATCAAAGTGTACAACAGCTCAGGTGCTGTCACTCCTCAGGACGACCTT 



1470 



1632 



1530 



1471 GAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACC 

I I 11 I II III I I I I I I I I I I I II II I 11 

1633 GCTGAGTTCTCATCCAAGCTGTCACCCCAGATGACCCAATCCTTGCTAGAGAATGAGGCC 1692 

1531 AGCAACATGA CCTATGGG 1548 

I I III 

CTGAACCTGAAGAACCAGAGTCTCGCAAGACAGACTGACCCATCCTGCACAGCGTTTGGC 1752 



1693 



1549 ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATC 

I I I I I I I I I I I I I I I I I I I I I 1 M I I I I I I I I I I I I I I I I I I I I 
1753 ACCTTCAACTCCCTGGGCGGTCACCTCATCATCCCTAATTCAGGAGTAAGCTTGCTGATT 



1608 



1812 



1668 



1609 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 

I II I I I I I I I I I I I II I I M I I M M I I III I I I I I I 
1813 CCCGCTGGGGCCATTCCTCAGGGGAGAGTCTATGAAATGTATGTGACTGTACACAGGAAA 1872 

1669 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 1728 

III I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I 
1873 GAAAATATGAGGCCCCCCATGGAAGACTCTCAGACCCTGCTGACCCCTGTGGTGAGCTGT 1932 

1729 GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 17 88 

I I I I I II II I III I I I I I I I I I I M I I I I II I I I I I I I I I I M 
1933 GGGCCTCCTGGAGCTCTGTTGACCCGTCCTGTCATCCTCACTCTGCATCACTGTGCAGAC 1992 

178 9 CCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAG 1848 
I I I I I I I I I I I I II I M I I I I I II I I I I I I I I I I M 



Db 


1993 


CCTAACACCGAGGACTGGAAGATCCAGCTCAAAAACCAGGCAGTGCAGGGACAGTGGGAG 


2052 


Qy 


1849 


GATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 

lllll II 1 1 1 Ml Mini II 

GATGTTGTGGTTGTTGGAGAGGAAAACTTCACAACCCCCTGTTACATTCAGCTGGATGCA 


1908 


Db 


2053 


2112 


Qy 


1909 


AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 

M 1 M 1 1 M M II 1 1 1 1 1 11 11 1 1 1 1 II 1 11 II 1 1 1 
GAGGCTTGCCATATTCTCACGGAGAACCTCAGTACCTATGCCCTGGTTGGGCAGTCCACC 


1968 


Db 


2113 


2172 


Qy 


1969 


AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 

II M M II M II II 1 1 1 II 1 II 1 II 1 1 1 II 1 1 1 II 1 1 II 
ACCAAAGCAGCTGCCAAGCGTCTGAAGCTGGCCATCTTTGGGCCCCTCTGCTGCTCTTCC 


2028 


Db 


2173 


2232 


Qy 


2029 


CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 

M Mill 1 II II 1 II 1 1 II M M M II M 1 II II Mill II Mill II 

CTGGAGTATAGCATCAGAGTCTACTGCCTGGATGACACACAGGATGCCCTGAAGGAAGTT 


2088 


Db 


2233 


2292 


Qy 


2089 


GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTC 

1 II II M 1 1 1 11 II II M II II II 1 lllll II 1 1 1 1 1 1 
CTACAACTGGAAAGGCAAATGGGAGGACAGCTCCTAGAAGAACCCAAGGCTCTTCATTTT 


2148 


Db 


2293 


2352 


Qy 


2149 


AAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAG 

II 1 II 1 II 1 II II II II II 1 1 II II II II 1 M M II 1 II 1 II 1 

AAAGGCAGCATCCACAACCTGCGCCTCTCAATTCATGACATCACCCATTCCCTCTGGAAG 


2208 


Db 


2353 


2412 


Qy 


2209 


AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 

1 1 II Mil 1 1 1 M II 1 1 1 M 1 1 1 1 1 M 1 II 1 1 II 1 1 1 1 1 1 1 1 
AGCAAATTGCTGGCTAAGTATCAGGAAATTCCATTTTACCACATCTGGAGTGGCTCTCAA 


2268 


Db 


2413 


2472 


Qy 


2269 


CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 

1 II 1 1 M II II II II M II II 1 1 II II M 1 II 1 1 1 II 1 1 

AGAAACCTCCACTGTACCTTCACTCTGGAAAGACTCAGCCTAAACACAGTAGAGCTGGTT 


2328 


Db 


2473 


2532 


Qy 


2329 


TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 

1 1 II 1 II M 1 II M 1 1 II 1 M II II 11 1 11 II M 1 1 1 II 1 1 1 1 
TGCAAACTCTGTGTGCGGCAGGTTGAAGGAGAAGGGCAGATCTTCCAGCTCAACTGTACT 


2388 


Db 


2533 


2592 


Qy 


2389 


ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 
1 1 II M 1 1 1 lllll lllll II II 

GTGTCAGAGGAACCTACTGGTATCGATTTGCCTCTCCTGGACCCTGCTAGCACCATTACC 


2448 


Db 


2593 


2652 


Qy 


2449 


GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 

1 1 II 1 1 11 II 1 II II lllll II II II 1 1 II II 1 M 1 1 
ACTGTCACTGGACCAAGTGCTTTCAGCATCCCTCTTCCTATCCGGCAGAAGCTATGTAGC 


2508 


Db 


2653 


2712 


Qy 


2509 


AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 

II 1 II M 1 1 II 1 1 III II 11 II 1 11 M II M 1 1 1 11 1 1 II 

AGCCTGGATGCCCCCCAAACAAGAGGCCATGACTGGAGGATGCTGGCCCACAAACTTAAC 


2568 


Db 


2713 


2772 


Qy 


2569 


CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 lllll II i 1 1 lllll 1 

1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mil 1 lllll II 1 1 1 lllll 1 

CTGGACAGGTACTTGAATTACTTTGCCACCAAATCGAGCCCGACTGGCGTAATCCTGGAT 


2628 


Db 


2773 


2832 


Qy 


2629 


CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 

II II 1 II 1 1 1 1 II II II II 1 II 1 II 1 1 1 M 1 II II 1 II 1 III 

CTCTGGGAAGCACAGAACTTCCCCGATGGAAACCTGAGCATGCTGGCAGCCGTCCTGGAA 


2688 


Db 


2833 


2892 



Qy 2689 GGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I I M II I I I I I I I I I I I I III 

Db 2893 GAAATGGGAAGACACGAAACCGTAGTGTCCTTGGCAGCAGAAGGTCAGTATTGA 2946 
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and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


2752 


100. 


0 


2752 


6 


ABK37922 


Abk37922 


2 


2687.4 


97. 


7 


2697 


6 


ABK52891 


Abk52891 


3 


2676.4 


97. 


3 


2881 


6 


ABK49422 


Abk49422 


4 


2368.6 


86. 


1 


2907 


4 


AAK52261 


Aak52261 


5 


2356 


85. 


6 


3580 


6 


ABK15169 


Abkl5169 


6 


2259 


82. 


1 


3014 


2 


AAV52940 


Aav52940 


7 


2252.2 


81. 


8 


2697 


6 


AAS16843 


Aasl6843 



cDNA enco 
Human net 
DNA encod 
Human pol 
Human REP 
Rat UNC-5 
Rat netri 



8 


1562. 


4 


56. 


8 


1787 


2 


AAV52941 


Aav52941 


Human UNC 


9 


1304. 


2 


47 . 


4 


9700 


4 


AAC90958 


Aac90958 


Plasmid p 


10 


1206. 


6 


43 . 


8 


1321 


4 


AAH99530 


Aah99530 


Human pro 


11 


970 


35 . 


2 


3646 


5 


AAS75738 


Aas75738 


DNA encod 


12 


Q 

^7 O O . 


o 


34 . 


0 


2860 


6 


ABT06279 


Abt06279 


Human NOV 


13 


J? O vJ • 


> ^ 


34 . 


0 


2860 


6 


ABT06280 


Abt06280 


Human NOV 


14 


y X O . 


. o 


33 . 


2 


2895 


6 


ABQ93898 


Abq93898 


Human tra 


15 


Qn9 

? u ^ < 


A 
. r± 


32. 


8 


3884 


4 


AAS21316 


Aas21316 


Human cDN 


16 


009 




32 . 


8 


3884 


7 


ACD23925 


Acd23925 


Novel hum 


17 


Qn9 


4 

. *1 


32 . 


8 


3884 


7 


ACA67 066 


Aca67066 


cDNA enco 


18 


Q n9 


/I 


32 . 


8 


3884 


7 


AGA03675 


Aca03675 


cDNA enco 


19 


yu^: . 


A 


32 . 


8 


3884 


7 


ABX89213 


Abx89213 


DNA encod 


20 


Q n 9 
y u z , 


/I 
. fi 


32 . 


8 


3884 


7 


ACD41867 


Acd41867 


Human sec 


21 


Q no 


/I 


32 . 


8 


3884 


7 


ACA04096 


Aca04096 


Human cDN 




yuz . 


. fl 


32 . 


8 


3884 


8 


ADA45664 


Ada45664 


Novel hum 


^ o 




/I 


32 . 


8 


3884 


8 


ADA76095 


Ada76095 


Human PRO 


94 


Q n o 
y uz 


>1 
. 4 


32 . 


8 


3884 


8 


ADA18745 


Adal8745 


Human PRO 


25 


Q n 9 
y uz 


A 


32 . 


8 


3884 


8 


ADA61368 


Ada61368 


Homo sapi 


9fi 


n o 

yuz 


/I 


32 . 


8 


3884 


8 


ADB19153 


Adbl9153 


Novel hum 


97 


Q n o 
yuz 


• ft 


32 . 


8 


3884 


8 


ADB27694 


Adb27694 


cDNA enco 


9fi 
z, o 


Q n o 

yuz 


/I 

. H 


32 . 


8 


3884 


8 


ADAS 6173 


Ada86173 


Novel hum 


9 Q 


yuz 


A 
. fi 


32 . 


8 


3884 


8 


ADB15737 


Adbl5737 


Human PRO 




Q n o 
y Uz 


A 

• 4 


32 . 


, 8 


3884 


8 


ADA47523 


Ada47523 


Human PRO 


O X 


Q n o 

yuz 




32 . 


, 8 


3884 


8 


ADA67318 


Ada67318 


Human PRO 


'^9 


Q n o 

yuz 


/I 
. fi 


32 . 


. 8 


3884 


8 


ADB30325 


Adb30325 


cDNA enco 


33 


Q n o 

yuz 


. fi 


32 . 


. 8 


3884 


8 


ADAS 5 621 


AdaS5621 


Novel hum 




Q n o 

yuz 


A 

. ft 


32 , 


, 8 


3884 


8 


ADA96833 


Ada96833 


Human PRO 




yuz 


A 
. fi 


32 . 


. 8 


3884 


8 


ADA79137 


Ada79137 


Human PRO 




902 


, 4 


32 , 


. 8 


3884 


8 


ADA87276 


Ada87276 


Novel hum 


o / 


902 


.4 


32 , 


. 8 


3884 


8 


ADB16478 


Adbl6478 


Human PRO 


o o 


902 


.4 


32 , 


. 8 


3884 


8 


ADA91570 


Ada91570 


Novel hum 


39 


902 


.4 


32 , 


. 8 


3884 


8 


ADB14633 


Adbl4633 


Human PRO 


40 


902 


.4 


32, 


.8 


3884 


8 


ADB18594 


Adbl8594 


Novel hum 


41 


902 


.4 


32 


.8 


3884 


8 


ADA93809 


Ada93809 


Human PRO 


42 


902 


.4 


32 


.8 


3884 


8 


ADB19705 


Adbl9705 


Novel hum 


43 


902 


.4 


32 


.8 


3884 


8 


ADB13017 


Adbl3017 


Human PRO 


44 


902 


.4 


32 


.8 


3884 


8 


ACD98496 


Acd98496 


Novel hum 


45 


902 


.4 


32 


.8 


3884 


8 


ADA74271 


Ada74271 


Human PRO 



ALIGNMENTS 



RESULT 1 

ABK37922 

ID ABK37922 standard; cDNA; 2752 BP. 
XX 

AC ABK37922; 
XX 

DT 21-MAY-2002 (first entry) 
XX 

DE cDNA encoding Human protein NOVl . 
XX 

KW Human; NOVX; ss; gene; cardiomyopathy; atherosclerosis; diabetes; 

KW cell signal processing disorder; metabolic disorder; obesity; infection; 

KW anorexia; cancer-associated cachexia; cancer; neurodegenerative disorder; 

KW Alzheimer's disease; Parkinson's disease; immune disorder; 



KW haematopoietic disorders; dyslipidaemia; pain; asthma; hypertension; 

KW osteoporosis; Crohn's disease; multiple sclerosis; angina pectoris; 

KW myocardial infarction; ulcer; allergy; benign prostatic hypertrophy; 

KW psychosis; neurological disorder; anxiety; schizophrenia; 

KW manic depression; dementia; dyskinesia; Huntington* s disease; 

KW Gilles de la Tourette^s syndrome; gene therapy. 

XX 

OS Homo sapiens. 
XX 

PN WO200210216-A2. 
XX 

PD 07-FEB-2002. 
XX 

PF 30-JUL-2001; 2001WO-US024225 . 
XX 

PR 28-JUL-2000; 2000US-0221409P . 

PR 04-AUG-2000; 2000US-0222840P . 

PR 04-AUG-2000; 2000US-0223752P . 

PR 04-AUG-2000; 2000US-0223762P . 

PR 04-AUG-2000; 2000US-0223769P . 

PR 04-AUG-2000; 2000US-0223770P . 

PR 14~AUG-2000; 2000US-0225146P . 

PR 15-AUG-2000; 2000US-0225392P . 

PR 15-AUG-2000; 2000US-0225470P . 

PR 16-AUG-2000; 2000US-0225697P . 

PR Ol-FEB-2001; 2 001US-0263662P . 

PR 05-APR-2001; 2001US-0281645P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Padigaru Mezes P, Mishra V, Burgess C, Gasman S, Grosse WM; 

PI Alsobrook JP, Lepley DM, Gerlach VL, Macdougall JR, Smithson G; 
XX 

DR WPI; 2002-180074/23. 

DR P-PSDB; AAU85403. 
XX 

PT New isolated cytoplasmic, nuclear, membrane bound, or secreted 

PT polypeptide, useful for treating cardiomyopathy, atherosclerosis, 

PT infections, cancer, neurodegenerative, metabolic, hematopoietic and 

PT immune disorders. 
XX 

PS Claim 9; Page 9-10; 213pp; English. 
XX 

CC The invention relates to an isolated cytoplasmic, nuclear, membrane 

CC bound, or secreted polypeptide (NOVX, x= 1-14) their variants or mature 

CC form. Also included are the nucleic acids encoding the NOVX proteins, a 

CC vector comprising the nucleic acid, a cell comprising the vector, an anti 

CC -NOVX antibody and modulators of NOVX. NOVX, the nucleic acid and the 

CC antibody are useful for treating or preventing a NOVX-associated 

CC disorder, where the disorder is selected from cardiomyopathy, 

CC atherosclerosis, diabetes, a disorder related to cell signal processing 

CC and metabolic pathway modulation, metabolic disorders, obesity, 

CC infectious disease, anorexia, cancer-associated cachexia, cancer, 

CC neurodegenerative disorders, Alzheimer's disease, Parkinson's disease, 

CC immune disorders, haematopoietic disorders, and the various 

CC dyslipidaemias, metabolic disturbances associated with obesity, the 

CC metabolic syndrome X and wasting disorders associated with chronic 



CC diseases, bacterial, fungal, protozoal and viral infections, pain, 

CC bulimia, asthma, hypertension, urinary retention, osteoporosis, Crohn's 

CC disease, multiple sclerosis, Albright Hereditary Osteodystrophy, angina 

CC pectoris, myocardial infarction, ulcer, allergy, benign prostatic 

CC hypertrophy, and psychotic and neurological disorders, including anxiety, 

CC schizophrenia, manic depression, delirium, dementia, and dyskinesias, 

CC such as Huntington's disease and Gilles de la Tourette's syndrome. The 

CC nucleic acid is useful in gene therapy. The present sequence encodes a 

CC NOVX protein 

XX 

SQ Sequence 2752 BP; 505 A; 937 C; 829 G; 481 T; 0 U; 0 Other; 

Query Match 100.0%; Score 2752; DB 6; Length 2752; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2752; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 

M I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I 

Db 1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 

Qy 61 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 

I I I I I I I I I I I M I I I I I I M I I I I I I I I M M I I I I I I I I M I I I I I I I I I I I I I I I I I 

Db 61 GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 

Qy 121 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I M 

Db 121 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 

Qy 181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I M I I I I I I I I I M I I I I I M I M 

Db 181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

Qy 241 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 

I I I I I I I I I I I I I I I I I I I I I I M M M I I I I I I I M I M I I I I M I I I I I I I I I I I I I I 

Db 241 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 

Qy 301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

M I I I I I I I I I I I I M I I I I M I I I I I M I I I M I I I I I I I I I I I I I I I I 1 I I I I I I M I 

Db 301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

QV 361 ATT7KATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 

I I I I I I I M I I I I I I I I I M M I I I I I M I I I I I I I I I I I I I I M I I M I I I 

Db 361 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 

Qy 421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

M I I I I I 1 I I M I I I I I I I I I I I I I I M I I 1 I I M I I I I I I I I I I I I I I I M I I I I M M 

Db 421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

Qy 481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 540 

I I I I M I I I I I M I M I I M I I I I I I I I I M I I I I I M I I I M I I I I I I I M I I 

Db 481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 540 

Qy 541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 

IIIIIMIIIIIIIMMIIIIIMMIIIMIIIIIIIMMIIIIIIIIIIIMIIM 
Db 541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 



Qy 



601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 



601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 
661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 720 

IIMIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIMIIIMN 

661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 720 

721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 7 80 

I I I I I I I I I M 1 I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 780 

781 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 840 

M I I I I I I I I I I I I I I I I I I I I I M I I M I I M M I I I I I I I I I I I I I I I I I I I I M I I I 

781 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 840 
841 AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 900 

I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I M M I M I I I I I I I I I I I I I I I 

841 AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 900 
901 GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 960 

I I I I I I I I M M I M I I I M I I I I I M I I I I I I I I I I I I I M I I M M I I I I I I I I I I I I 

901 GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 960 
961 AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 102 0 

I I I I I I I I I I M I I M I I 1 I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

961 AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 1020 
1021 GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 1080 

I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I M I I I I 

1021 GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 1080 

1081 AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 1140 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
1081 AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 1140 

1141 ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 1200 

I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

1141 ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 1200 
1201 AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I M I I I i I I I I I I I I I I I I I I I I M I 

1201 AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 1260 
1261 CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 1320 

I I I M M I M I I I I I I I I I M M M I I M I I I I I I I I I I M I I I I I M I I I I I M I I I M 

1261 CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 1320 
1321 CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 1380 

I I I I I I I I I I I I I M I I I M I I I M I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I 

1321 CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 1380 
1381 CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 1440 

I I I I I M I I I I I I I I M I I I M I I I I I I I M I I I I I I I M I I I I I I M I I I I I I I I I I M 

1381 CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 144 0 
1441 CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 

I I I M I M I M I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I M M I I N I I I I I 



1441 CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 
1501 AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 1560 

IIIIIMIIIIIIIIMIIMIIIIMMIIIIIMIIMIMIIIIIMMIIIIIIII 

1501 AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 1560 
1561 CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I M I I I I I 

1561 CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 1620 
1621 ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGG 1680 

I I I I I I M I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I M I I I I I I I I I M I I I I I 

1621 ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGG 1680 

1681 TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
1681 TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 1740 

1741 GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 18 00 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I 
1741 GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 1800 

1801 AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 

I I I I I I M I M I I I M I I I I I I I I I I I I M I I I I M I I I I I I I M I I I I I I I I I I I I I I I 

1801 AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 
1861 CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 1920 

I MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIMIM 

1861 CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 1920 
1921 GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 1980 

M I I I M I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

1921 GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 1980 
1981 GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 2040 

I I I I M I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I M I I I M M I I I I I I I 

1981 GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 2040 

2041 ATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAG 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I N I M I N I I I I I I I I I I i I I I I 
2 041 ATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAG 2100 

2101 AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 

IIIIMIIIMMIMIMMIIIIMIMMIIIMIIMIMIIIIIIIIIMIIMI 

2101 AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 

2161 CACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTT 2220 

I I I I I I I I I I I I I M I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2161 CACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTT 2220 

2221 GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 2280 

IIIIMIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIMMIIIIIIIIII 
2221 GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 2280 

2281 TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 2340 

I I I I I I M I I I I I M I I I I I I I M I M I M I I I I I I I I I I I I M I I I I I I M I I I I I I I I 

22 81 TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 2340 



Qy 2341 GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 2400 

IIIIIIIIIIIIIIIIIIIIIIIIIIMMIMIIIMIMMIMMIIIMIIMill 
Db 2341 GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 2400 

Qy 2401 ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 24 60 

I M I I I M I I I I I I M I I I I M I I I I I I I I I I M I M I I M I I I M I I I I I I I I 

Db 2401 ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 24 60 

Qy 2461 CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

I I I I I I I I I I M I I I M M I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I M I I I 

Db 2461 CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

Qy 2521 CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 2580 

I I I I I I M I I I I I I M I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I M I I I I I I I 

Db 2521 CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 2580 

Qy 2581 CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 2640 

I M I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M 

Db 2581 CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 2640 

Qy 2641 CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 

I M I I I I I I I I I I I M I I I I I M I I I M M I I I I I I I I I I I I I M I I I I I I I I I I 

Db 2641 CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 

Qy 2701 CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 

M I I I I M I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I M I 

Db 2701 CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 



Ves 



:ESULT 2 

ABK52891 

ID ABK52891 standard; DNA; 2697 BP. 
XX 

AC ABK52891; 
XX 

DT 27-AUG-2002 (first entry) 
XX 

DE Human netrin binding membrane receptor UNC5H-1 DNA sequence #1. 



XX 
KW 



Netrin binding membrane receptor; receptor; UNC5H-1; gene; ds ; human; 

KW nootropic; neuroprotective; cytostatic; antiparkinsonian; 

KW cerebroprotective; cancer; central nervous system; CNS; stroke; 

KW Parkinson's disease; multiple sclerosis; Alzheimer's disease. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .2697 

FT /*tag= a 

/product= "Netrin binding membrane receptor UNC5H-1" 

XX 

PN WO200233080-A2. 
XX 

PD 25-APR-2002. 

XX 

PF 15-OCT-2001; 2001WO-EP0118 91 . 



XX 

PR 16-OCT-2000; 2OOj0tS"O24OO61P, 
XX 

PA (FARB ) BAYER AG. 
XX 

PI Koehler RH; 
XX 

DR WPI; 2002-463314/49. 

DR P-PSDB; AAU97899. 
XX 

PT Novel human netrin binding membrane receptor polypeptide and 

PT polynucleotides for identifying modulating agents useful in treating 

PT diseases e.g. Parkinson's disease, multiple sclerosis, stroke, 

PT Alzheimer's disease. 

XX 

PS Claim 1; Fig 1; 94pp; English. 



XX 
CC 



This invention relates to the DNA and protein sequences of a novel 

CC purified human netrin binding membrane receptor, UNC5H-1. The DNA 

CC sequence of the invention is useful as a probe for detecting a nucleic 

CC acid encoding the UNC5H-1 protein in a biological sample. The sequences 

CC of the invention are useful to screen for agents which decrease the 

CC activity of the UNC5H-1 protein. The sequences are also useful for 

CC screening agents which regulate (modulate) the activity of the protein of 

CC the invention. A pharmaceutical composition containing the protein of the 

CC invention or a reagent that modulates the activity of the UNC5H-1 protein 

CC may be useful for treating a UNC5H-1 dysfunction related disease such as 

CC cancer or a central nervous system (CNS) disorders (e.g, Parkinson's 

CC disease, multiple sclerosis, stroke and Alzheimer ' s disease). Fusion 

CC proteins comprising the UNC5H-1 protein are useful for generating 

CC antibodies and for in various assay systems, and the protein can be used 

CC as a bait protein in a two-hybrid assay or three-hybrid assay. The method 

CC of the invention is useful for detecting a coding sequence for the UNCSH- 

CC 1 protein. The present sequence represents a DNA sequence encoding the 

CC human netrin binding membrane receptor UNC5H-1 protein of the invention 
XX 

SQ Sequence 2697 BP; 503 A; 906 C; 807 G; 481 T; 0 U; 0 Other; 

Query Match 97.7%; Score 2687.4; DB 6; Length 2697; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 2691; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 105 

I M I I I I I I I M I I I M I I I I I I M I I I I I I I I I I I M M I I I I M I I MUM 

ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 60 
CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 165 

I M M M M M M M M M M M M M M M M M M M M M M M M M M M M M I 

CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 120 
GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 225 

I I I M M M M M M M M M M M M M M M M M M M M I M M M M M M M M 

GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 180 
GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 285 

I I I M M M M M M M M M M M M M M M M M M M M M M M M M M M M I 



Qy 


46 


Db 


1 


Qy 


106 


Db 


61 


Qy 


166 


Db 


121 


Qy 


226 


Db 


181 



286 TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 345 

I I I I I I I I I I I I I I I I I I M I I I I I I M I I I M I I I I I I M I I I I I I M I I I I I I I M I I 

241 TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 300 
346 ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 4 05 

I I I I I I I I I I I I I I I M I I M M M M I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 360 
406 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 4 65 

I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I M I I I I M I I I I I I M I I I I I I I I I I 

361 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 420 
466 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

I I I M I I I M I I I I I I I I M I I I I I I I I I I I I I M I I I I M I I I I I I I I M I I I I I I 

421 TACATCCGCATAGCCTATTTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 4 80 
526 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 585 

M I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I N I I M I I I I I I I 

481 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 540 
586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 

M I I I I M I I I I I I I I I I I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I M I I I M 

541 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 600 
646 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 705 

I M I I I I I I I I I M I I I I I M M I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I 

601 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 660 

7 06 TGCGTGGCCAAG7WVCATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 765 

I I I I I I I I I I I I I M I I M I M I I I I I M I I I M I I I I I I I I M I I I I I I M I I 

661 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 72 0 
766 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 825 

I I I I I I I I I I I M I I I I I M I M M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I 

721 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 780 
826 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 885 

M I I I I I M I I I I I I I I I I I I M I I I I I I I I M I I M I I M I M M I I I I I M I I I I I I I 

781 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 84 0 

8 86 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I 1 M 

841 TGTGAGGGGCAGAATGTCCAGAA?JVCAGCCTGCGCCACCCTGTGCCCAGTGGACGGCAGC 900 

946 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 1005 

I I M I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I M I M I I I I M I I I I I I I I 

901 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 960 
1006 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1065 

I I M M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I 

961 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1020 
1066 ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

I I I M I I I I I M I I M I I I M M I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I M I 

1021 ACCCGCAACTGTACCAGTGACCTCTGTGTACACACTGCTTCTGGCCCTGAGGACGTGGCC 108 0 



1126 CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1185 

M M I I I I I I I M I I I I I I I M M I I I I I I I I M M M I I I I I I I M I I I I I M I 

1081 CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1140 
1186 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

I I I M I I I I I I M I I I I I M I M M I I I I I I I I I I M I M I I I I 1 I I I M I I I 

1141 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1200 

1246 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1305 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I M I I I I I I I I I 

1201 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1260 
1306 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

I I I I I I I I I I I I I I I I I M 1 I I I I I I M I I I I I I I I I I I I I I I I I I M I I 1 I 1 I M I I I I 

1261 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1320 
1366 CAGGATGGGCCCAGCCCC7\AGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I 

1321 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1380 
1426 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1485 

I I I I I M I I I I I I I I I I I I I M I M I I I I M I I I I M M I I I 1 I 1 I I I I I I I I I M I I I I 

1381 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1440 
1486 CGCCTCTCCACCCAGT^CTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 1545 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I M I 

1441 CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 1500 
1546 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

I I I I M I I I I M I I I I I I I I I I I I I I I I I I M M I M I I M I I I I I I I I I I I I I I I I I I 

1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTC 1560 
1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

I I I I I M I I I I I I I I I I I M I I I I I I I I I I I M M I M I M M I I I I I I I I I I I I I I I M 

1561 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1620 
1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

I I I I I M I I I I I I I I I I I I I I I I M I M M M I M I M I I I I I I I I I I 1 I M M I I I I M 

1621 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1680 
172 6 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

M I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I M I I I I I M I I I 

1681 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 17 40 

1786 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I M 1 I I I I I I I I I I I I M I 
1741 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1800 

1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I M I I I I 

1801 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1860 

1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

I I I I I I M I I I I I I I I M I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I 11 I I I I I M N I 
1861 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1920 

1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 



Db 


1921 


Qy 


2026 


Db 


1981 


Qy 


2086 


Db 


2041 


Qy 


2146 


Db 


2101 


Qy 


2206 


Db 


2161 


Qy 


2266 


Db 


2221 


Qy 


2326 


Db 


2281 


Qy 


2386 


Db 


2341 


Qy 


2446 


Db 


2401 


Qy 


2506 


Db 


2461 


Qy 


2566 


Db 


2521 


Qy 


o /r o /C 

2626 


Db 


2581 


Qy 


2686 


Db 


2641 



M I 1 I M I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 1980 
TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

I I I M M M I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M 

TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 204 0 

GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

I I I 1 I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2100 

TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2205 

M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I M I 

TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2160 

AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 22 65 

I I I I I I I I I I I M M I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2220 

CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

I I I I I M I I I I I I M I I I I M I I I I I I I I M I I I M I I I I M I I I I I I I I I I I I I I I M I 

CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 22 80 
GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

I I I I I M I I I I I M I I I I I M I I I I I I I M I I I M I I M M I I I M I I I I I I I I M I I I I 

GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2340 
AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 2445 

I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I M M I M I I I I I I I I I I I M I I I 

AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 2400 

CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2505 

I I I I I I I I M I I I I I I I I I I I M I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2460 

TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCI3GACTCTGGCCCAGAAACTC 2565 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 252 0 
CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2 625 

I I I I I I I I I M I M I M I I I I I I I I I I I I I I I I I I I I I M M I I I I I I M I I I I I I I I I I 

CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2580 
AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 2685 

I I I M I I I I I I I M I I I I I M I I I I M I I I I I I I M I I I I I M I I I I I I M I I I I I I I I I 

AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 2640 

GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I 
GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2697 
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XX 
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XX 
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XX 
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XX 

KW Human; NOVX polypeptide; cardiomyopathy; atherosclerosis; cancer; 

KW cell signal processing; metabolic pathway modulation; cancerous tissue; 

KW antibody; diabetes; transgenic animal; UNC5-like protein; NOVl; 

KW chromosome 13; gene; ds . 

XX 
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XX 

FH Key Location/Qualifiers 
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XX 
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XX 

PT Novel isolated NOVX polypeptide, and encoded polynucleotide, useful for 

PT treating cardiomyopathy, artherosclerosis, and cancer. 

XX 

PS Claim 8; Page 7-8; 180pp; English. 
XX 

CC The present invention relates to a new NOVX polypeptide having a 900 

CC (NOVl), 4349 (N0V2) , 940 (N0V3) , 798 (N0V4), 865 (N0V5), or 331 (NOV6) 

CC residue amino acid sequence, as given in the specification. The novel 

CC polypeptide, and its encoding polynucleotide, are used to treat 

CC cardiomyopathy, atherosclerosis, cancer or a disease related to cell 

CC signal processing and metabolic pathway modulation, in a human. Detecting 

CC the polypeptide or polynucleotide is useful for identifying cancerous 

CC tissue. The antibody can be used to treat diabetes or cancer. The host 

CC cells can be used to produce non-human transgenic animals useful in drug 

CC screening. The present nucleic acid sequence is that of the human UNC5- 

CC like NOVl gene located on chromosome 13. This sequence encodes the human 

CC UNC5-like protein NOVl of the invention 

XX 

SQ Sequence 2881 BP; 526 A; 985 C; 868 G; 502 T; 0 U; 0 Other; 

Query Match 97.3%; Score 2676.4; DB 6; Length 2881; 
Best Local Similarity 98.9%; Pred. No. 0; 

Matches 2728; Conservative 0; Mismatches 21; Indels 9; Gaps 3 




Qy 



1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 



Db 


42 


CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 


101 


Qy 


61 


GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 

1 M M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M M M M 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 

GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 


120 


Db 


102 


161 


Qy 

Db 


121 
162 


CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCC7WVCCCGGACCTGCTTCCCCAC 


180 
221 


Qy 


181 


TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 


240 


Db 


222 


281 


Qy 
Db 


241 

282 


AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 
AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 


300 
341 


Qy 


301 


GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 III II 1 II II 1 1 II 1 1 1 1 

GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGTGAGCCGACCATGGAGGTCCGC 


360 


Db 


342 


401 


Qy 

Db 


361 
402 


ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 

1 II II II II 1 II 1 II M II 1 II II II 1 1 1 1 1 1 II 1 II II 1 II M II II 1 1 1 1 1 1 1 1 II II 
ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 


420 
461 


Qy 


421 


TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 

1 1 1 1 1 1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 M 1 1 1 1 1 1 

TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 


480 


Db 


462 


521 


Qy 


481 


AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 
II 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II II II II II II 1 II II II 
AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 


540 


Db 


522 


581 


Qy 


541 


ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 

M M II 1 1 II 1 1 1 1 1 1 1 II 1 1 II II 1 M 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 II 1 1 1 1 II M 1 M 1 1 

ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 


600 


Db 


582 


641 


Qy 


601 


AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 


660 


Db 


642 


701 


Qy 


661 


CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 II M 1 1 1 1 1 II 

CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCT^GAAC 


720 


Db 


702 


761 


Qy 


721 


ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 


780 


Db 


762 


821 


Qy 


781 


TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 

1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 

TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 


840 


Db 


822 


881 


Qy 


841 


AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 


900 



I I I I I M I I I I I M I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I M 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



882 AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 941 



901 



942 



GTCCAGAA— AACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGG 

I I I I I I I I I I I I I II I I II I II I II M I I I I I I I I I II II 

GTCCATGACCGCACCGTCTCCTCTCTGCTTGTCTCTGTGGACGGCAGCTGGAGCCCGTGG 



958 AGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGAC 

I M II II II I I I I I II I M I I M I I I I I I I I I I M I II I I I I I M I I I I I I I I I II I I I I 
1002 AGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGAC 



957 



1001 



1017 



1061 



1018 CCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGT 1077 

I I I I I I I M I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I II 
1062 CCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGT 1121 

1078 ACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGC 1137 

I I I I I I I I M I I I I I I I I I II I I I I II I I I I II II I I I I I M I I I II I I I I I II I I I I I I 
1122 ACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGC 1181 

1138 CTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGC 1197 

I I I I 1 I I I I I I I I I M I II I I I I I I I I I I I II I I I I I I I I II M I I I I I I I I I I I I I I I I 
1182 CTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGC 1241 

1198 CGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTC 1257 

I I I M I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I 

12 42 CGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTC 1301 



1258 



CAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCG 1317 

I I I I I I I I I M I M I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I 

1302 CAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCG 1361 



1318 GACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCC 1377 

I I I I I I I I I I I I II II I I I I II I I I I M I I I II M I I I I I I I I I M I II I I I I I I I I 

1362 GACCTCAG CACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCC 1418 

1378 AGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCAC 1437 

II I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I M I I I I II I I I I I I I I I 
1419 AGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCAC 1478 

1438 ACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACC 1497 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I 

1479 ACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACC 1538 

1498 CAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAAC 1557 

I I I I I M I I I I II I I M I I I I I I II I I I I I II I I I I I I I II I M I I I I I I I I I I I I I I I I 
1539 CAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAAC 1598 

1558 TTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGAT 1617 

I I I I I I I I M I I II I I I I I II I II I I II I II I I I I II I I II M I M I M II I II I I I I I I 

1599 TTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGAT 1658 

1618 GCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTG 1677 

I II I I I I I I M I I I I I I M I I I I I I II I I I I I I I I II I I I I I I I I II II I I II I II I I II 
1659 GCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTG 1718 

1678 AGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCT 1737 

I I I I I I I I I I I I I M M M I I M I II I II I I II I I M I I I I M I I I I I I I I I II I I I I I I 

1719 AGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCT 177 8 



Qy 


1738 


Db 


1779 


Qy 


1798 


Db 


1839 


Qy 


1855 


Db 


1899 


Qy 


1915 


Db 


1959 


Qy 


1975 


Db 


2019 


Qy 


2035 


Db 


2079 


Qy 


2095 


Db 


2139 


Qy 


2155 


Db 


2199 


Qy 


2215 


Db 


2259 


Qy 


2275 


Db 


2319 


Qy 


2335 


Db 


2379 


Qy 


2395 


Db 


2439 


Qy 


2455 


Db 


2499 


Qy 


2515 


Db 


2559 



GGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCT 1797 

M I I I I I I I M I I I M I I I I I I I I I I I I I M I I I I I I I I I M I M I I I M I I I I I I I I M 

GGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCT 1838 

GACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGG AGGATGTG 1854 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I III 

GACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGCAGGATGTG 1898 

CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCC 1914 

II I II II I II II II I II II I I I II II II I II I II II II II M II I II II II I II II II II 
CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCC 1958 

TGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTG 1974 

I I II II II I II II I II I II II II M II M I II I II II M I M II II I I I II I M I I I II I 

TGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTG 2018 
GCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAG 2034 

M I I I I I M I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

GCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAG 207 8 
TACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAG 2 094 

I I I I I I I M I I I I I I I I I I M I I I I I I I I M I I II I I M I I I I I I I I I I I I I 11 II M I I 

TACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAG 2138 

CTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGAC 2154 

II I I I I I I I I I I M I I I I I I I I I I I I I I II I II I I I I I M I I I I I I I I I I I I II I I I II I 
CTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGAC 2198 

AGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAG 2214 

I I I I I I I I I I I I I M I M I I I I I I II I I II I I I I I I M I I I I M M I I I I I I I I I I II I I 

AGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAG 2258 
CTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTAC 2274 

I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M M I I I I II I I I I I I I I I I I I 

CTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTAC 2318 

TTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAG 2 334 

I I M I I I M I I I I I I I I M I I I I I I II I II I I I I I I I I I II I I I I M I I I I I I I I I I II I 
TTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAG 2378 

CTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACC 2394 

I I I I I I I I I I M M I I I I I M M I I M I I I I I I I I I I I I I M I I I I I I I I I II M I I I I I 

CTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACC 2438 

AAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTG 2454 

I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I II I I I I I I I M I 11 I I I I I I I I I I I I 
AAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTG 2498 

GTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTG 2514 

I I II M I II M I I I I I I I I I I I I I M M I I I I I I I I I I M I I I I I M I I I I I I I I I I I M 

GTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTG 2558 
GACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGAC 2574 

I I I I I I I I I I I I I I I I I I M I I I M M I I I I M I M I I I M I I I I M I I I M I I I I I I I I 

GACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGAC 2618 



Qy 2575 AGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGG 2634 

I M M I I I I I I M I M I I I I I I I I I M I I I I M I I I I M M I I I I I I I I I M I I M M I I 

Db 2619 AGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGG 2678 

Qy 2635 GAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTG 2 694 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 1 I I I I I I M I M I I I M I I I I I I I M I I I I 
Db 2679 GAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTG 2738 

Qy 2 695 GGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 

I I I I I I I I I I I I I M I M I I I I M I I I M M I I I I I I I I I M I I I I I M I I I I M I I I 

Db 2739 GGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 27 96 
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PS Claim 1; Page 2691-2694; 6221pp; English. 
XX 

CC The invention relates to polynucleotides (AAK5 145 6-AAK5 3435) and the 

CC encoded polypeptides (7^78323-AAM80302 ) that exhibit activity elating to 

CC cytokine, cell proliferation or cell differentiation or which may induce 

CC production of other cytokines in other cell populations. The 

CC polynucleotides and polypeptides are useful in gene therapy, vaccines or 

CC peptide therapy. The polypeptides have various cytokine-like activities, 

CC e.g. stem cell growth factor activity, haematopoiesis regulating 

CC activity, tissue growth factor activity, immunomodulatory activity and 

CC activin/inhibin activity and may be useful in the diagnosis and/or 

CC treatment of cancer, leukaemia, nervous system disorders, arthritis and 

CC inflammation. Note: Records for SEQ ID NO 2110 (7UVK52581), 2111 

CC (AAK52582) and 3666 (AAM80020) are omitted as the relevant pages from the 

CC sequence listing were missing at the time of publication 

XX 

SQ Sequence 2907 BP; 552 A; 966 C; 881 G; 508 T; 0 U; 0 Other; 

Query Match 86.1%; Score 2368.6; DB 4; Length 2907; 

Best Local Similarity 93.0%; Pred. No. 0; 

Matches 2587; Conservative 0; Mismatches 19; Indels 177; Gaps 3; 

TCGCCGCTTGGCTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGC 151 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
TGGCGGGCAGACAGCACGGATGGAGCGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGC 229 

CTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCG 211 
I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M 
CTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCG 289 

TCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCA 271 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

TCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCA 349 

AGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGA 331 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
AGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGA 409 

GCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGG 391 

I I M M I I I I I I I I I I M I I I I I I I I I I I I I I I M M I I I I I I I I I I M I I I I I I M I I I 

GCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGG 469 

TGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCA 451 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
TGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCA 52 9 

AGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGC;JftjbAACTTCGAC^AGGAGCC 511 

I M I I I I I I I I I M M I I I M I I I I M I I I M I I I l| I I I I I I M I I I I M I I I M I 

AGAGTCAGAAGGCCTACATCCGCATAGCCTATTTGCGCAkcAACTTCGAGCAGGAGCCGC 589 

TGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCA 571 

I I I I I I I I I I I I I M M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
TGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCA 649 

TCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACC 631 

I I I I I M I I I I I I I I I I I I M I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACC 709 



Qy 


92 


Db 


170 


Qy 


152 


Db 


230 


Qy 


212 


Db 


290 


Qy 


272 


Db 


350 


Qy 


332 


Db 


410 


Qy 


392 


Db 


470 


Qy 


452 


Db 


530 


Qy 


512 


Db 


590 


Qy 


572 


Db 


650 



Qy 


632 


Db 


710 


Qy 


692 


Db 


770 


Qy 


752 


Db 


830 


Qy 


766 


Db 


890 


Qy 


785 


Db 


950 


Qy 


845 


Db 


1010 


Qy 


905 


Db 


1070 


Qy 


965 


Db 


1130 


Qy 


1025 


Db 


1190 


Qy 


1085 


Db 


1250 


Qy 


1100 


Db 


1310 


Qy 


1160 


Db 


1370 


Qy 


1 o o o 


Db 


1430 


Qy 


1280 


Db 


1490 



CCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACA 691 

I I I I I I I I I I I I M I I I I I I I I M I I M I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I 

CCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACA 769 

CGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTG 751 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I M I I I I I I I I I I 

CGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTG 829 

CTGTCATCGTCTAC 765 

I I I I I I I I I I I I I I 

CTGTCATCGTCTACGGTGGGCCCCGGGACTCCCTGGTCACAGGGAGAGGCACTGCGGTGC 889 

GTGAACGGTGGGTGGTCGA 7 84 

I I I I I I I I M I I I I I I I I I 
CCCTGGGCAGTGACATGTGGCTGTCCTTCTCTGTCCGGCCAGTGAACGGTGGGTGGTCGA 94 9 

CGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCC 844 

I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M M M I I I I I I I I M I I I I I I 

CGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCC 1009 

GGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCC 9 04 

I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
GGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCC 1069 

AGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGT 964 

I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I M M I I I I I I I I I I I I I I I I I I I 
AGAAAACAGCCTGCGCCACCCTGTGCCCAGTGGACGGCAGCTGGAGCCCGTGGAGCAAGT 1129 

GGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCAC 1024 

I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M 
GGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCAC 1189 

CCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTG 1084 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
CCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTG 1249 

ACCTCTGTGTACACA 1099 

I I I I I I I I M M I I I 

ACCTCTGTGTACACAACTCCTACACCCCTGCCCCCACCAAGGCCATGCTGTCTCCCGCAG 1309 

GTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCC 1159 

M I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCC 1369 

TGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACT 1219 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I 
TGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACT 1429 

CAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCA 1279 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
CAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCA 1489 

GCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCA 1339 

M I I M I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I 
GCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCA 1549 



Qy 


1340 


Db 


1550 


Qy 


1400 


Db 


1610 


Qy 


1460 


Db 


1670 


Qy 


1520 


Db 


1730 


Qy 


1580 


Db 


1790 


Qy 


1640 


Db 


1850 


Qy 


1700 


Db 


1888 


Qy 


1760 


Db 


1925 


Qy 


1820 


Db 


1985 


Qy 


1880 


Db 


2045 


Qy 


1940 


Db 


2105 


Qy 


2000 


Db 


2165 


Qy 


2060 


Db 


2225 


Qy 


2120 


Db 


2285 


Qy 


2180 



CCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCA 1399 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I 

CCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCA 1609 
ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCA 1459 

I M I M I I I M I I M I M I I I I I I I I I I I I I M I I M I I I I I I I I I M I I I I I I M I I M 

ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCA 1669 
CCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGC 1519 

I I I I I I M I I I I I I I I I I I I I I M I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

CCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGC 172 9 
CCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGA 1579 

I I I M I I M I 1 I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGA 1789 
TCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCT 1639 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

TCCCTAATACAGGAATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCT 1849 

ATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTC 1699 
I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I M I I 

ATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTG 1887 

AGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAG 1759 

I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

AGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAG 1924 

TCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCA 1819 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I 
TCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCA 1984 

AAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCT 1879 

I I I I M I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I 

AAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCT 2044 

CCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGG 1939 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I M 
CCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGG 2104 

GCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGC 1999 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGC 2164 

TTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGC 2059 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I M M I I I I 
TTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGC 2224 

ATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGC 2119 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
ATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGTyVGCAGCTGGGGGGACAGC 2284 

TGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCA 2179 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I M I I I I I I I I I I I I I I 
TGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCA 2344 

2180 TCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCC 2239 



Db 


2345 


Qy 


2240 


Db 


2405 


Qy 


2300 


Db 


2465 


Qy 


2360 


Db 


2525 


Qy 


2420 


Db 


2585 


Qy 


2480 


Db 


2645 


Qy 


2540 


Db 


2705 


Qy 


2600 


Db 


2765 


Qy 


2660 


Db 


2825 


Qy 


2720 


Db 


2885 



MMMIIIIIIIIIMIIIMMIIMIMMIIIIMIIMMIIIIIIIIIIMIII 

TCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCC 2404 
CCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGC 2299 

I I I I I I M I M I I I I I I M I I I I I I I I I I I M I M I I I I I I I I I I I M I I I I I I I I I I M 

CCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGC 24 64 

GTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCG 2359 

I I I I I I I I M I I I I I I I I I I M I I I M I I I I M I I I 11 I I I I I I I I I I I I I I I I I I I I I I 
GTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCG 2524 

ACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGC 2419 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGC 2 584 

TGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCC 2479 

I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I M I I I I I i I I 
TGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCC 2 644 

CCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCG 2539 

I M I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M M I I 

CCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCG 27 04 
ACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCA 2599 

I I M I I I I M I I I I I I I I I I I I I I I I I I I I I 1 I M I I I I M I M I I I M I I I I I I I I M I 



AGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCA 2659 

I I I I I M M M I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

AGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCA 2824 

ACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCA 2719 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I 
ACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCA 28 84 

CAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I M M I I I I I I I I I I I I I 

CAGTGTCGGAGGCTGAGTGCTGA 2907 
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ABK15169 

ID ABK15169 standard; DNA; 3580 BP. 
XX 

AC ABK15169; 
XX 

DT 23-APR-2002 (first entry) 

XX 

DE Human REPTR 1 cDNA sequence. 
XX 

KW REPTR; human; antiinflammatory; cytostatic; immunosuppressive; antiviral; 

KW anti-HIV; antiarthritic; anticonvulsant; nootropic; neuroprotective; 

KW antiallergic; antibody; immunogen; endometriosis; 

KW gastrointestinal disorder; gastritis; oesophageal carcinoma; 

KW Crohn's disease; irritable bowel syndrome; ulcerative colitis; 

KW endocrine disorder; hypothalamus disorder; Kallman's disease; 

KW autoimmune disease; inflammatory disease; infertility; receptor; 



KW acquired iitunune deficiency syndrome; AIDS; rheumatoid arthritis; allergy; 

KW osteoarthritis; diabetes mellitus; multiple sclerosis; 

KW systemic lupus erythematosus; cell proliferative disorder; cancer; 

KW developmental disorder; Duchenne muscular dystrophy; gene; 

KW Becker muscular dystrophy; neurological disorder; epilepsy; 

KW Alzheimer's disease; Huntington's disease; reproductive disorder; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 4. .2532 

FT /*tag= a 

FT V "REPTRl protein" 

XX >^ . , 

PN WO200198354-A2:.-- • ' 
XX 

PD 27-DEC-2001. 
XX 

PF 21-JUN-2001; 2001WO-US019942 . 
XX 

PR 21-JUN-2000; 2000US-0214027P. 

PR 25-AUG-2000; 2 0OOUS-O228045P . 

PR 12-DEC-2000; 2000US-0255104P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Griffin JA, Kallick DA, Tribouley CM, Yue H, Nguyen DB, Tang YT; 

PI Lai P, Policky JL, Azimzai Y, Lu DAM, Graul R, Yao MG, Burford N; 

PI Hafalia AJA, Baughn MR, Bandman O, Patterson C, Yang J, Xu Y; 

PI Gandhi AR, Warren BA, Ding L, Sanjanwala MS, Duggan BM, Lu Y; 
XX 

DR WPI; 2002-090432/12. 

DR P-PSDB; AAU17818. 
XX 

PT Twelve human receptors (referred to as REPTR-1 to REPTR-12), useful in 

PT the diagnosis, treatment and prevention of gastrointestinal (e.g. 

PT gastritis), autoimmune/inflammatory (e.g. osteoarthritis) and cell 

PT proliferative (e.g. cancer) disorders . 
XX 

PS Claim 57; Page 142-143; 157pp; English. 

XX 

CC This invention relates to twelve human receptors cDNA sequences referred 

CC to as REPTR-1 to REPTR-12), and the proteins encoded thereby. The 

CC proteins of the invention may have antiinflammatory, cytostatic, 

CC immunosuppressive, antiviral, anti-HIV, antiarthritic, muscular active 

CC general, anticonvulsant, nootropic, neuroprotective, antiallergic 

CC activities. The sequences of the invention may be used to produce REPTR 

CC agonists or antagonists, and the protein sequences may be used to raise 

CC anti-REPTR antibodies. These molecules and the REPTR polynucleotides and 

CC polypeptides of the invention are useful in the diagnosis , treatment and 

CC prevention of gastrointestinal (e.g. gastritis, oesophageal carcinoma, 

CC Crohn's disease, irritable bowel syndrome, ulcerative colitis), endocrine 

CC (e.g. hypothalamus disorder, Kallman's disease), autoimmune/ inflammatory 

CC (e.g. acquired immune deficiency syndrome (AIDS), rheumatoid arthritis, 

CC allergies, osteoarthritis, diabetes mellitus, multiple sclerosis, 

CC systemic lupus erythematosus), cell proliferative (e.g. cancer), 

CC developmental (e.g. Duchenne and Becker muscular dystrophy), neurological 



CC (e.g. epilepsy, Alzheimer's disease, Huntington's disease) and 

CC reproductive (e.g. infertility, endometriosis) disorders. Numerous other 

CC examples of each disorder are given in the specification. The present 

CC sequence represents the human REPTRl cDNA sequence of the invention 

XX 

SQ Sequence 3580 BP; 670 A; 1233 C; 1025 G; 652 T; 0 U; 0 Others- 
Query Match 85.6%; Score 2356; DB 6; Length 3580; 
Best Local Similarity 93.6%; Pred. No. 0; 

Matches 2537; Conservative 0; Mismatches 5; Indels 168; Gaps 1; 

GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGG 102 

I I I I I I I I M M M I I M I I I I I I I I M I II M I I I M I I I I I I I I I I M M I I I I M I I 

GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGG 60 
CTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAAC 162 

I I I I I I I I M I I II II I M II M M M I II I M II II II I II I I 1 I II II I I I I I I I I M 

CTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAAC 120 
CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAG 222 

M I I I II M I I I I I II I I I II I II I M I II I I M I I I I I I I I II II I I I I II I I I I I I II 

CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAG 18 0 

CCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGG 282 

I I I I I I I I I I I I I I M II II II II I II I II II M I I I I I I I II I I I II I I II II I I II II 
CCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGG 24 0 

GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 342 

I I M I I I I I I I I I II I I I I I M I II I I I I I I I I I I I M I I I I I II I I II I I I M I I II II 

GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 300 

CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTG 4 02 

II I II I II II I I I II I I I I I M I M I I I I I II I I I I I I I I I II I II I I I I M M I I I I I I 
CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTG 360 

GAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAG 4 62 

I II II I I I I I I I I I M M II I I I I I II I II M I I I II II I I I I M I I I I I II I I I I I I I I 



Qy 


43 


Db 


1 


Qy 


103 


Db 


61 


Qy 


163 


Db 


121 


Qy 


223 


Db 


181 


Qy 


283 


Db 


241 


Qy 


343 


Db 


301 


Qy 


403 


Db 


361 


Qy 




Db 


421 


Qy 


523 


Db 


481 


Qy 


583 


Db 


541 


Qy 


643 


Db 


601 


Qy 


703 



I I I I I Ml I I I M I M I I I I I I I I 



I I I M I I M I M I I I I I I I I I I I I I I I I I I I I 



GCCTACATCCGCATAGCCTATTTGCGC; iASAACTTCGAGCAGGAGCCGCTGGCCAAGGAG 480 



GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 582 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 54 0 
GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 642 

I I I M I I I I M I I I I I I I I I I I I I I I I I I I I N I I I I I I M I I I M I I I I I I I M I I M I 

GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 600 



I I I I I I I I I I I I I I I I I M 1 I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I 



702 

660 



ACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAbCGCCTCCGCTGCTGTCATCGTC 762 

I I M I I I I I I I I I I I I I I I I I I I I I I M I I M 1 I M I I M I I M I I I I I I I M I I I I I I I 



Db 


661 


Qy 


763 


Db 


721 


Qy 


823 


Db 


727 


Qy 


883 


Db 


727 


Qy 


943 


Db 


733 


Qy 


1003 


Db 


793 


Qy 


1063 


Db 


853 


Qy 


1123 


Db 


913 


Qy 


1183 


Db 


973 


Qy 


1243 


Db 


1033 


Qy 


1303 


Db 


1093 


Qy 


1363 


Db 


1153 


Qy 


1423 


Db 


1213 


Qy 


1483 


Db 


1273 


Qy 


1543 


Db 


1333 



661 ACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTC 720 

:gtgaacggtgggtggtcgacgtggaccgagtggtccgtctgcagcgccagctgtggg 822 
MM 

^.KTr; 726 



726 

TTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGC 942 

M I M I 

GACGGC 732 

AGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGC 1002 

I M M I M M M M M M M M M M M M M M M M M M M M M M M M M M M 

AGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGC 7 92 
CGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTG 1 062 

M M M M M M I M M M M M M I M M M M M M M I M M M M M M I M M M 

CGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTG 852 
GACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTG 1122 

M M M M M M M M M M M M M M M M I M M M M M M M M M I M M M I 

GACACCCGCAACTGTACCAGTGACCTCTGTGTACACACTGCTTCTGGCCCTGAGGACGTG 912 
GCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTC 1182 

M M M M M M M M M M M M M M M M I M I M M M M M M M I M M 

GCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTC 972 
ATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT 1242 

M M M M M M M M M M M M M M M M M M I M M I M M M M M M M M M 

ATCCTCGTTTATTGCCGGAAGTU^GGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT 1032 
CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTG 1302 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 

CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCT^AAGCAGACAACCCCCATCTG 1092 
CTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCC 1362 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 

CTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCC 1152 
CGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTG 1422 

M M M M M M M M M M M M M M M M M M M M M M M M M M M M M M 

CGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTG 1212 
GGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTC 1482 

M M M M M M M M M I M M M M M M M M M M M M M M M I M M M M M 

GGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTC 1272 
TCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACC 1542 

M M M M M M M I M M M M M M M M M M M M M M M M M M M M M M I 

TCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACC 1332 
TATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTC 1602 

M M I M M I M M M M M M M M M M I M I M M M M M M I M I M M M M I 

TATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTT^ATACAGGAATCAGCCTC 1392 



Qy 


1603 


CTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1662 


Db 


± J J? o 


r T r a T r r r r r r A AT r* AT A r r r r AGGG AAGAT CT AT GAGAT CT ACCT C ACGCT GC AC 


1452 


QY 


1663 


AAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTT 


1722 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


J. 1 o 


AAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTT 


1512 


Qy 


1723 


AGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGT 


1782 




1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


J. O -L O 


Af:rTr;Tnf;ArrrrrTf;GrGTrrTGrTCACCCGGCCAGTCATCCTGGCTATGGACCACTGT 


1572 


Qy 


1783 


GGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGC 


1842 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 




Db 




rrrr* Arz^rr ArzrrPTnAPAnrTnt^Af^rrTnrGrrTrAAAAAfirAGTrRTfir'GAGGGCAGCl 


1632 


Qy 


1843 


TGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTG 


1902 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 




Db 


Xb Jo 




1692 


Qy 


1903 


GAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAG 


1962 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 




r a r* r" r* a r'T r* P r'T r^^T A P f^T PTT r* A P r Af^P A nPT nnf^r r* fir*TT T r r T GGT GGGAGAG 


1752 


Qy 


1963 


GCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGC 


2022 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 T R 


ppppTPAf^Pr^TnnPTrJPPrJPPAAnPRPPTPAAGPTGPTTPTGTTTGCGCCGGTGGCCTGC 


1812 


Qy 


2023 


ACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAG 


2082 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 




A r« r* T P P PT P r:: A riT A P A AP AT P P fifif^T P T APT GP P T GCAT GACACCCACGAT GCACT CAAG 


1872 


Qy 


2083 


GAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTG 


2142 




1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 Q'7 '5 
1 d / O 


PAPPTPPTPPAPPTnr^AflAArJPAriPTnnrif^f^RAPAClPTnATPrAGGAGr'CACGGGTCCTG 


1932 


Qy 


2143 


CACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTG 


2202 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 




PAPTTPAAr:r:APA(^TTAPPAPAAPPTP;PGr'rTATrrATC'C*ACGATGTGCCCAGCTCCCTG 


1992 


Qy 


2203 


TGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGC 


2262 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 




TrrtrrAAnAnTAAnPTPPTTnTPAGPTAPPAGGAGATCCCCTTTTATCACATCTGGAATGGC 


2052 


Qy 


2263 


ACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGAC 


2322 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 




Db 


^ U ^ sJ 


APf^PAr;Pfir:TAPTTGrArTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGAC 


2112 


Qy 






2382 




1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2113 


CTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAAC 


2172 


Qy 


2383 


TTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 


2442 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 




Db 


2173 


TTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 


2232 



Qy 2443 GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 2502 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I 

Db 2233 GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 22 92 

Qy 2503 ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 2562 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M 
Db 2293 ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 2352 

Qy 2563 CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 2622 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I ! I I I I I I I M I I 

Db 2353 CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 2412 

Qy 2623 CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 2682 

I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 2413 CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 2472 

Qy 2683 GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I I I I I I M I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2473 GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2532 

Qy 2743 GGCCGGCCAG 2752 

I I I I I I I I M 
Db 2533 GGCCGGCCAG 2542 



RESULT 6 
7\AV52940 

ID AAV52940 standard; cDNA; 3014 BP. 

XX 

AC AAV52940; 
XX 

DT 25~MAR-2003 (revised) 

DT 21-DEC-1998 (first entry) 

XX 

DE Rat UNC-5 homologue unc5h-l cDNA. 
XX 

KW UNC-5; UNC5H-1; rat; netrin receptor; cell migration; axon guidance; 

KW diagnosis; therapy; ds . 

XX 

OS Rattus sp. 

XX 

FH Key Location/Qualifiers 
FT CDS 1. .2697 

FT /*tag= a 

XX 

PN WO9837085-A1. 
XX 

PD 27-AUG-1998. 
XX 

PF 19-FEB-1998; 98WO-US003143 . 
XX 

PR 19-FEB-1997; 97US-00808982 . 
XX 

PA (REGC ) UNIV CALIFORNIA. 
XX 

PI Tessier-Lavigne Leonardo ED, Hinck li, Masu M, Keinomasu K; 

XX 



DR WPI; 1998-495364/42. 

DR P-PSDB; AAW78898. 
XX 

PT Netrin-binding^ vertebrate proteins - useful for diagnosis, therapy and 

PT the biopharmaceutical industry. 

XX 

PS Claim 7; Page 15-17; 32pp; English. 

XX 

CC This cDNA, termed unc5h-l, comprises a rat homologue of Caenorhabditis 

CC elegans unc-5. Rat unc5h~l and unc5h-2 (see AAV52942) cDNAs were isolated 

CC from an E18 brain cDNA library. The predicted proteins (see AAW78898 and 

CC AAW78900) show similarity with UNC-5. They are predicted to be involved 

CC in cell migration and axon guidance, and are characterised as receptor 

CC proteins for netrins . Gene expression is observed in regions where 

CC differentiating neurons are undergoing axogogenesis . Human unc5h-l (see 

CC AAV52941) and unc5h-2 (see 7iAV52943) cDNAs are also claimed. Vertebrate 

CC UNC-5 proteins may be produced recombinantly from transfected host cells 

CC by utilising these vertebrate UNC-5 nucleic acids. The invention also 

CC provides unc-5 hybridisation probes and primers, vertebrate UNC-5- 

CC specific binding agents such as specific antibodies, and methods of 

CC making and using the subject compositions in diagnosis (e.g. genetic 

CC hybridisation screens for vertebrate unc-5 transcripts), therapy (e.g. 

CC gene therapy to modulate vertebrate unc-5 gene expression) and in the 

CC biopharmaceutical industry (e.g. as immunogens, reagents for modulating 

CC cell guidance, reagents for screening chemical libraries for lead 

CC pharmacological agents, etc.). (Updated on 25-MAR-2003 to correct PI 

CC field.) 
XX 

SQ Sequence 3014 BP; 596 A; 977 C; 849 G; 592 T; 0 U; 0 Other; 

Query Match 82.1%; Score 2259; DB 2; Length 3014; 

Best Local Similarity 89.7%; Fred. No. 0; 

Matches 2427; Conservative 0; Mismatches 280; Indels 0; Gaps 0; 

^TGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 105 

I I I M M I I I I I I I I I I I I I M M I I I I I M I I I I I I I M I I I I I I I I I I I Mill 



II II I I M I I I I I M I I I I I I II I I I I I I I II I II I I I I M I I I I I I I I I I I I 



II II I I I I II I I II I I I I M II I II I I I I I I I M I I I I I I I I I I II II I II II 



III II I I I I I I I I I I I I I I I I I II I I I II I I II I I II I I I I M II II I I I II 



II I I I I I I I II I II II I I I II M I I I I I M I III I I I I I I M III 



I I I I I I I I I II I I II II II II I I I I I I I M I I I I I I I Mill I I M M M I 



Qy 


46 


Db 


1 


Qy 


106 


Db 


61 


Qy 


166 


Db 


121 


Qy 


226 


Db 


181 


Qy 


286 


Db 


241 


Qy 


346 


Db 


301 



Qy 406 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 4 65 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 361 GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAA7\AGTCAGAAGGCC 420 

Qy 4 66 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

M M I I I I II III II I II I I II I II II I I I I I I I I I II I I I I I I II I I I III 
Db 421 TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 4 80 

Qy 526 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 585 

II I I I I I I II I I I II II II II II II II II II II I I I I I I I I I I III 

Db 4 81 TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 540 

Qy 586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 

I I I I I I I I I M II II I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I 
Db 541 GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 600 

Qy 646 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 705 

I II I I I II M I I I I I I I II Mill II I II I I I M I II I II I I I I I M I I I I II I I 

Db 601 ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 660 

Qy 706 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 765 

II I I I I I I I I M M I I I I I II I I I I I I M III I I I I II II Mill II II 

Db 661 TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 720 

Qy 766 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 825 

II II II II M II II I II II II II II I I I II II II II II II I M II I II II M II I II I 
Db 721 GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 780 

Qy 82 6 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 8 85 

II M II II I II II II I II I II II I II I II I I I II I II I I M II I II II II I M II Ml 
Db 781 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 840 

Qy 886 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

II II I II M I II I M I M II II I M I II I I II I II II I M I I II I I M I M II III 

Db 841 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 900 

Qy 94 6 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 1005 

II II I II I M I I I II I II II II M II I II II II I II I II II I II I M M II I I I 
Db 901 TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 960 

Qy 1006 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1065 

M M M I M I II II I I M II II II M Mill M M II II I III II I M M I I M 

Db 961 GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 102 0 

Qy 1066 ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

II M II I II M M II I II I M II M I I I II I II I II I MM II II II II II I 

Db 1021 ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 1080 

Qy 1126 CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1185 

Mill I II M I II II II II I M M I M II M II I I II I I M I 

Db 1081 CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 1140 

Qy 1186 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

III II II II II II II I II I II II II I II II II II I II I II II I II II I I III 

Db 1141 CTCATTTACTGTCGCT^GAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 1200 



Qy 1246 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1305 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I 

Db 1201 ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 1260 

Qy 1306 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I Ml I II 

Db 1261 ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 1320 

Qy 1366 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

II II I I I I I I M I I II I II I I I I I I I I I I I Mill M I M II II M I M I I I I I 

Db 1321 CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 1380 

Qy 1426 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1485 

I II II I I II II I I I I I I I I I II I I I I I I II I I I I II I I I I I I I I I I I II I II 
Db 1381 AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 1440 

Qy I486 CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 1545 

II I I I I I M I II II I I I I I I I I II I M I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 1500 

Qy 1546 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

II I I I I I II I II I I I II I II I I I I I II I I I II I I I II II I I I I I M I I I I I I I I I I I I 
Db 1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 1560 

Qy 1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1561 ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 1620 

Qy 1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I 
Db 1621 CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 1680 

Qy 1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

mil I I I II II M I I I I I I I I I I I I I I I I I I II I I M II I I I I I II I I I I I II 
Db 1681 TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 174 0 

Qy 1786 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 

I I II II I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I II I I III 
Db 1741 GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 1800 

Qy 1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I I I I I II I I I I I I I I I I II I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I II 
Db 1801 GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 18 60 

Qy 1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

I II I I I I M II I I M I I I I I I M I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 1861 GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 1920 

Qy 1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2 02 5 

I I I I I I I I I I I I II I II II II I I M I I III II I II M I I II II I II II I I I II 
Db 1921 CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 1980 

Qy 2026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

I I I I I I I I I I I I I I I M I I II I I I I I I I I II I I I I I I I I I I I II I I I I I I II I 
Db 1981 TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 2040 

Qy 2086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 



I I M I I I I I I I I I I I I I I I I I I I II I M I M I I I I I I I I I I I I I I II I M II I I II 

Db 2041 GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 2100 

Qy 2146 TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2205 

I I I I I I I I I M I I I I I I II I I I II II I I I I II I I I M I I I M I I I I I I I I I II II 

Db 2101 TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 2160 

Qy 2206 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2265 

I I I I I I I I I I II I I M I I M I I I I I M I I I I I I I I I I I I I I M I II I M Mill 

Db 2161 AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 2220 

Qy 2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

I I II III II I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I II II I I I I I I 
Db 2221 CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 2280 

Qy 2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I M II I I I I I I I I I I I M I I I I I I I I I I 
Db 22 81 GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 2340 

Qy 2386 AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 2445 

I II II I I I I I I I I I II II I I I I I I M I I II I I I I I I I I I I I I I II I I I I I I I M I 

Db 2341 AACATCACTAAGGACACAAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 2400 

Qy 2446 CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2505 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I II 
Db 2401 CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATC 2460 

Qy 2506 TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 2565 

II II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I II I I I 
Db 2461 GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 2520 

Qy 2566 CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2625 

I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I 
Db 2521 CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 2580 

Qy 2 626 AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 2 685 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 2581 AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 264 0 

Qy 2686 GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 2745 

II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 1 I I I I I I I I 

Db 2641 GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGAGAC 2700 

Qy 2746 CGGCCAG 2752 

I I I I I I 

Db 2701 CAGCCAG 2707 



RESULT 7 
AAS16843 

ID AAS16843 standard; cDNA; 2697 BP. 

XX 

AC AAS16843; 
XX 

DT 14-FEB-2002 (first entry) 
XX 

DE Rat netrin receptor UNC5H1 (YSG7) cDNA. 



KW YSG; YSG7; schizophrenia; chronic animal model; LCGU; netrin receptor; 

KW local cerebral glucose utilisation; phosphodiesterase 1-alpha; UNC5H1; 

KW calcium-independent alpha-latrotoxin receptor; CIRL; trkE; synapsin lA; 

KW epithelial discoidin domain receptor 1; synapsin IB; neuroleptic; ss; 

KW tumour necrosis factor alpha; TNF-alpha; rat. 
XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .2697 

FT /*tag= a 

FT /product= "Rat netrin receptor UNC5H1" 
XX 

PN WO20017544 0-A2. 

XX 

PD ll-OCT-2001. 
XX 

PF 02-APR-2001; 2001WO-GB0014 86 . 
XX 

PR 31-MAR-2000; 2000GB-00007880 . 

PR 26-MAY-2000; 2000GB-00012768 . 
XX 

PA (WELF-) WELFIDE CORP. 
XX 

PI Cochran S, Paterson G, Ohashi Morris B^ Pratt J; 

XX 

DR WPI; 2002-010813/01. 

DR P-PSDB; AAU10543. 
XX 

PT Novel chronic animal model of schizophrenia^ useful for identifying anti- 

PT psychotic drugs and genes that are associated with schizophrenia. 

XX 

PS Claim 1; Fig 8a; 79pp; English. 
XX 

CC The invention relates to YSG polynucleotide fragments for use in 

CC diagnosing and/or developing treatments for schizophrenia using chronic 

CC animal models . The polynucleotides and their encoded polypeptides are 

CC used for identification of compounds which modulate the expression of YSG 

CC molecules, leading to the manufacture of schizophrenia medicaments. The 

CC sequences can also be used for testing candidate compounds for any effect 

CC on the polypeptides. Anti-schizophrenic effects of a compound can be 

CC determined by measuring local cerebral glucose utilisation (LCGU) or 

CC comparing its expression level with that of a control group. The 

CC sequences are useful in the identification of genes associated with 

CC schizophrenic states and in the development of an antibody. The sequences 

CC of the invention include phosphodiesterase 1-alpha, calcium-independent 

CC alpha-latrotoxin receptors (CIRL) -1, 2&3 , epithelial discoidin domain 

CC receptor 1 (trkE), netrin receptor (UNC5H1) , synapsins lA and AB and 

CC tumour necrosis factor (TNF) alpha. This sequence represents rat netrin 

CC receptor UNC5H1 (YSG7) DNA 

XX 

SQ Sequence 2697 BP; 541 A; 864 C; 766 G; 526 T; 0 U; 0 Other; 



Query Match 81.8%; Score 2252.2; DB 6; Length 2697; 

Best Local Similarity 89.7%; Pred. No. 0; 

Matches 2419; Conservative 0; Mismatches 278; Indels 0; Gaps 



0 



Qy 46 ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 105 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 60 

Qy 106 CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 165 

II II I II II I II II I II I I II I II I II I I I I II I I I I I I II I I M M I I I I I II 

Db 61 CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 120 

Qy 166 GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 225 

I 11 I I M I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I II I I I I I 
Db 121 GACCTGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 180 

Qy 22 6 GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 2 85 

III II I I I M I I I I I I I I I I I I Mill I M I I I I I I I I I II I I I I I I I I I I I 

Db 181 GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 240 

Qy 286 TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 345 

I I I I I I I I I I I I I II I I I II II II I M I I I M Ml I I M I I II MM 

Db 241 TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 300 

Qy 34 6 ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 4 05 

I I I I I I M II II II II M M II I II II II II II Mill I 11 M I II I 1 I II I 

Db 301 ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 360 

Qy 406 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 465 

II M II II II II I II II 1 II II M I II I M II I M M II I II I M I I II II I II I II 

Db 361 GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 420 

Qy 466 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

I I I M M I II 111 II II I II II I II I I II I 1 II M I II I I II II I II I I III 

Db 421 TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 480 

Qy 526 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 585 

M II M I II I Mill M II II M II II II II II I I I II Mill III 

Db 481 TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 54 0 

Qy 586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 

M II II M II I II II II II I II I I M II M I II II I M II II I II I 1 II I II 

Db 541 GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 600 

Qy 64 6 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCT^ACTACACC 705 

II II II I II II II II II II Mill I II I II II I II II II II II M II II I II 1 II 

Db 601 ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 660 

Qy 706 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 7 65 

11 II II II I II II II II II II II M 11 I I II M M II II I II II 

Db 661 TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 720 

Qy 766 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 825 

II II I II M M II M II M II II M I I I I I 1 II II II II II M 11 M M II II II I II 

Db 721 GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 780 

Qy 826 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 885 

II II II I 1 II II I 1 II II II II I M M II II II II I II II I II I M II II II M I 11 I 
Db 781 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 840 



Qy 


886 


TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 

M M M 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II II III 

TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 


945 


Db 


841 


900 


Qy 


946 


TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 
1 1 1 1 1 1 II M II 1 II 1 II 1 1 II 1 II 1 II 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 


1005 


Db 


901 


960 


Qy 


1006 


GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 

1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 M 1 M 

GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 


1065 


Db 


961 


1020 


Qy 


1066 


ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 

M 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 II M II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 


1125 


Db 


1021 


1080 


Qy 


1126 


CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 
Mill 1 II II 1 M 1 1 M M 1 1 1 1 1 II 1 M II 1 II II II 1 II 1 
CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 


1185 


Db 


1081 


1140 


Qy 


1186 


CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 
Ml II 1 1 II II 1 II 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 II 1 II 1 II II II II 1 1 III 
CTCATTTACTGTCGCAAGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 


1245 


Db 


1141 


1200 


Qy 


1246 


ACCTCAGGCTTCCAGCCCGTCAGCATCT^AGCCCAGCAAAGCAGACAACCCCCATCTGCTC 

Mill M II II II II 1 II 1 1 II 1 II II M M M M II II 1 II II II 1 1 1 M II M II 

ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 


1305 


Db 


1201 


1260 


Qy 


1306 


ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 

M M II 1 1 II 1 1 1 M II 1 II II 11 II II II II 1 II M 1 1 M II II M 1 III 1 II 

ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 


1365 


Db 


1261 


1320 


Qy 


1366 


CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 

1 M II II 1 1 II II 1 II II 1 1 II II II II II 1 1 M II M 1 1 II 1 

CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 


1425 


Db 


1321 


1380 


Qy 


1426 


GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 

1 1 II M 1 1 1 II M 1 II 1 1 II II II II M II 1 1 1 II 1 M Mill 1 1 i 1 II 1 II 

AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 


1485 


Db 


1381 


1440 


Qy 


1486 


CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 
M II II II 1 M II 1 II 1 1 1 1 1 1 II II II 1 II II II 1 1 II 1 II 1 II II 1 1 1 1 1 1 1 
CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 


1545 


Db 


1441 


1500 


Qy 


1546 


GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 

M M M M M 1 1 M II 1 M 1 1 II II II II II 1 M II 1 1 M II 1 1 M 1 1 1 1 1 1 1 1 M II 

GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 


1605 


Db 


1501 


1560 


Qy 


1606 


ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 

M Ml Mill llllllll IIMIIII llllllllllllll IMIMIII 

ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 


1665 


Db 


1561 


1620 


Qy 


1666 


CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 

II M 1 II 1 M 1 1 1 II 1 1 1 1 1 II II 1 II 1 II 1 11 1 II 1 II II II II 1 M 1 1 II 1 M II 

CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 


1725 


Db 


1621 


1680 


Qy 


1726 


TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 


1785 



Db 


1681 


TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 


1740 


Qy 


1786 


GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAA7VAAGCAGTCGTGCGAGGGCAGCTGG 

1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 III 

GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 


1845 


Db 


1741 


1800 


Qy 


1846 


GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 
1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 
GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 


1905 


Db 


1801 


1860 


Qy 


1906 


GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 

III 1 M 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 II 1 1 1 1 1 M 1 1 

GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 


1965 


Db 


1861 


1920 


Qy 


1966 


CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 
1 1 1 II 1 M 1 1 1 1 II 1 M 1 M 1 1 1 1 1 1 1 III 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 11 
CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 


2025 


Db 


1921 


1980 


Qy 


2026 


TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 


2085 


Db 


1981 


2040 


Qy 


2086 


GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 
GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 


2145 


Db 


2041 


2100 


Qy 


2146 


TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 


2205 


Db 


2101 


2160 


Qy 


2206 


AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 M 1 1 1 II 1 1 1 1 1 II 
TyVGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 


2265 


Db 


2161 


2220 


Qy 


2266 


CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 

1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 II 1 M 1 1 1 1 1 1 

CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 


2325 


Db 


2221 


2280 


Qy 


2326 


GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 
1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 
GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCTs^CTTC 


2385 


Db 


2281 


2340 


Qy 

Db 


2386 
2341 


AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 

1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AACATCACTAAGGACACAAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 


2445 
2400 


Qy 


2446 


CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 


2505 


Qy 


2506 


TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 

MM 1 1 II II II 1 1 II II II II 1 II 1 II 1 II II 1 1 1 II II 1 1 II 1 1 1 II 1 1 
GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 


2565 


Db 


2461 


2520 


Qy 


2566 


CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 


2625 



I I M I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 



Db 


2521 


CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 258 0 


Qy 


2626 


AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 2685 

INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 
Ill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1(1 

AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 264 0 


Db 


2581 


Qy 


2686 


GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 
II II 1 1 1 1 1 II II II 1 II 1 II II II 1 II II 1 1 1 1 II II II II 1 Mill III 
GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGA 2697 


Db 


2641 



RESULT 8 
AAV52 941 

ID AAV52 941 standard; cDNA; 1787 BP. 
XX 

AC AAV52 941; 
XX 

DT 25-MAR-2003 (revised) 

DT 21-DEC-1998 (first entry) 

XX 

DE Human UNC-5 homologue unc5h-l cDNA. 
XX 

KW UNC-5; UNC5H-1; human; netrin receptor; cell migration; axon guidance; 

KW diagnosis; therapy; ds . 

XX 

OS Homo sapiens - 



XX 

FH Key Location/Qualifiers 

FT CDS 3. .1670 

FT /*tag= a 

FT /transl_except= (pos:21. .22, aatXaa) 

FT /transl__except= (pos:200. .203, aarlle) 

FT /transl_except= (pes: 771. .772, aaiSer) 

FT /transl_except= (pos:785. .786, aa:Leu) 

FT /transl_except= (pos:1078. . 1079, aa:Xaa) 

FT X \/transl_except= (pos:1098. .1099, aa:Xaa) 

/transl_except= (pos:1106. .1107, aarXaa) 

Ff /transl_except= (pos:1621. .1622, aa:Gly) 

FT ^note= "these codons have apparent 1 nucleotide deletions 

FT or insertions, which alter the reading frame" 

XX / 



PN WO9837 085-A1. / 
XX 

>G.^2 7 -AUG- 1 9 98 . 
XX^ 

PF 19-FEB-1998; 98WO-US003143 . 
XX 

PR 19-FEB-1997; 97US-00808982 . 
XX 

PA (REGC ) UNIV CALIFORNIA. 
XX 

PI Tessier-Lavigne M, Leonardo ED, Hinck L, Masu M, Keinomasu K; 
XX 

DR WPI; 1998-495364/42. 

DR P-PSDB; AAW78899. 
XX 

PT Netrin-binding, vertebrate proteins - useful for diagnosis, therapy and 



PT the biopharmaceutical industry. 



XX 



PS Claim 7; Page 17; 32pp; English. 

S This partial cDNA, termed unc5h-l, comprises a human homologue of 

CC Ca:no?habdiiifS;gans unc-5. unc5h-l and unc5h-2 (see AAV52943) cDNAs 

CC were isolated from an embryonic brain cDNA library ^he predxcted 

proteins (see AAW78899 and AAW78901) show similarity with UNC-5 They are 
predicted to be involved in cell migration and axon guidance, and are 
Characterised as receptor proteins for netrins. Gene -PJ--^°;; 

CC observed in regions where differentiating ^^^^^^^.^^^^^^f f?;^"^42) cDNAs 
CC axoqoqenesis. Rat unc5h-l (see AAV52940) and unc5h-2 (see AAV52942) cDNAs 
^r^a?;" claimed. Vertebrate UNC-5 proteins may ^^^^^f/^ 
from transfected host cells by utilising these vertebrate UNC-5 nucleic 
acids. The invention also provides unc-5 hybridisation probes and 



CC 
CC 
CC 



CC 
CC 

CC acias . xne inveiiuiun a-i..=^ ^^^-^ ^ - .c^ „ 

CC primers, vertebrate UNC-5-specif ic binding agents such as specific 

CC antibodies, and methods of making and using_the^sub,ect^compositions in 

CC 
CC 
CC 



anriDoaicii, ain^ m^^ix^^^ ^-^ ^ - c 

diaanosis (e.g. genetic hybridisation screens for vertebrate unc-5 
tr^rcri^ts), 'therapy (e.g. gene therapy to modulate vertebrate unc-5 
aene expression) and in the biopharmaceutical industry (e.g. as 
CC Imogens, reagents for modulating cell guidance, -Jf f ^ .^^ ^^^^^^^f 
CC chemical libraries for lead pharmacological agents, etc.). (Updated on 25 
CC -MAR-2003 to correct PI field.) ^ 
XX '-^ 
SQ 



sequence 1787 BP; 349 A; 603 C; 491 G; 344 T; 0 U; 0 Other; y^r^^^ 

Query Match 56.8%; Score 1562 4; DB 2; Length 1787; ^^"^^ 

Best Local Similarity 98.5%; Pred. No. 2.1e-292; Gaps^8; 
Matches 1661; Conservative 0; Mismatches 16; Indels ,^^9, ^^Gaps 

ICAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGG^C^TGbcCCTCT 1129 

,,, I, 1,1,11111111111 I II I I II I lllllll II 111 Mil III 111111111! 
.ciiiiGTAciAOTiAicTCTG-GTACACACTGCTTCTGC^CCgGAGG^^ 59 



Qy 


1070 < 


Db 


1 


Qy 


1130 . 


Db 


60 


Qy 


1190 


Db 


120 


Qy 


1250 


Db 


180 


Qy 


1309 


Db 


240 


Qy 


1369 


Db 


300 


Qy 


1429 


Db 


360 



119 
1249 



ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCUTbL,! i ii v^^f t TTT T 7 

,,,,|,,,l||||lllllllllllllllllilllllll.'lll'llllllMll MM 

iTGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCC^^^ 

TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 

, II I I ,,, I , I ,,,, I I I I I I I I I I I I I II 1 11 I I I I M M M I I I I I I II 1 II II I M I 

UiiUGCCGGiAGJJ^GGiGiGGCTGGACTCAGATGTGGCTGACTCGTCCATTC^ 17 9 

CAGGCTTCCAGCCCGTCAGCATC-AAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 1308 

M I I I I I I I II I I II I 1 I II 1 I I I I I II I I 1 M 1 I I I II M 11 11 I II II II II M 11 I 
iiGGiii(l:iiicciGi(!J.iiAiiTJJvG^^ "9 

ATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 1368 

I 1 1 II I I I II 1 11 M I I I 1 II I I M I M 11 I 11 11 1 1 11 II M II I M 1 I II 1 I I I 11 M 

iiicAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 299 
GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 1428 

1 1 I II I I I II 1 1 1 I I I I I I M I II 1 1 I II 1 I 1 II MM 1 II I I II 11 11 II II 1 1 M M I 

GiiGGGCCC^GCCCcilGTTCCAGCTCAC^^ 359 
GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 1488 

IIMMMIMIMMMMMIMMIMIMMIIMIMIMIIMIMMMMM 

GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 4 1 9 



1489 CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 1548 
I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I M I M I I I I I I I I M 
420 CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 479 

1549 ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATC 1608 

I I I I II II II I II I I I II II II I I II I I II II I M I I I M I I I I I M I I II I I I I I I I I 
48 0 ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATC 539 

1609 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 1668 
M I II M I I I I I II I I II I I I I I I I M I I I I I I II I M I I II I I I I I I I II I II II I I I I 
540 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 599 

1669 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 1728 

I I I II II M II II I I I I M II II I M I I I I I I I I I M II II II M I I I I I I I I I I I I I I I 
600 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 659 

1729 GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 178 8 

I I I I I M I I II I I I I I I I I I I I I I I I I I I I M I I II I M I I I I I I M I I I I M I I I I I I I 

660 GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 719 

1789 CCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAG 1848 

I I I I I I I I II M I I I I I I I I I I I I M I M I I I I I II I I I M I I I I II I I I II I II II 
720 CCCAGCCCTGACAGCTGGAGCCTGGCCCTCAAAAAGCAGTCGTGCGAGGG-AGCTGGGAG 77 8 

1849 GATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 1908 
I I II I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
779 GATGT-CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 837 

1909 AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 1968 
I I I I II II M I I I I I II I I II I I II II I I II I I I I I I I I I I I I I I II M I I I II I I I I I I 
838 AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 8 97 

1969 AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 2028 
I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I 
898 AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 957 

2029 CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 2 08 8 

I I I I I M I I II I I I I I I I I I I I I I I I II I I I I I M I I I I I II I I I I I I II I I I I I I I I II 

958 CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 1017 

208 9 GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTC 214 8 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I 
1018 GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTT- 1076 

2149 AAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAG 2208 

I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I M I I I I I M I I I I I I I I I I 

1077 AAGGACAGTTACCACAACCT—GCCCTATCATCCACGATGTGCCCAGCTCCCTGTGGAAG 1134 

2209 AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 2268 

I I I I I I II I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I II 
1135 AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 1194 

2269 CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 232 8 

II I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I II II I I I M I I I I I 
1195 CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 1254 



Qy 2329 TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 2388 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M M I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1255 TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 1314 

Qy 2389 ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 2448 

I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 
Db 1315 ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 1374 

Qy 2449 GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 2508 

I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1375 GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 1434 

Qy 2509 AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 2568 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1435 AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 1494 

Qy 2569 CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 2628 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I 
Db 1495 CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 1554 

Qy 2629 CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 2688 

I I I I I I I I M I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1555 CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 1614 

Qy 2689 GGACTGGGCCAGCCAGACGCTGGCCTC-TTCACAGTG-TCGGAGGCTGAGTGCTGAGGCC 2746 

II I I I I I I I I I I I I I I I II II II II I II I II I I I I I I I M I I I II II I I I 

Db 1615 GGGACTGGCCAGCAGGACGGTGGCTTCTTTCACAGTGTTCGGAGGCTGAGTGCTGAGGCC 1674 

Qy 2747 GGCCAG 2752 

II II II 

Db 1675 GGCCAG 1680 



RESULT 9 
AAC90958 

ID AAC90958 standard; cDNA; 9700 BP. 
XX 

AC AAC90958; 
XX 

DT 19-MAR-2001 (first entry) 
XX 

DE Plasmid pGC1037 nucleotide sequence SEQ ID NO: 91. 
XX 

KW Human; Caenorhabditis elegans; UNC-5; splice variant; nematode worm; 
KW protein-protein interaction; identification; ss. 
XX 
OS 

XX 



Synthetic. 
PN WO200073328-A2. ^/ ' 

07-DEC-2000. 



x: 




X 

PF 02-JUN-2000; 2000WO-EP005108 . 
XX 

PR Ol-JUN-1999; 99GB-00012755 . 
XX 

PA (DEVG-) DEVGEN NV. 



XX 

PI Van Criekinge W, Roelens I, Bogaert Verwaerde P; 

XX 

DR WPI; 2001-016508/02. 
XX 

PT Three variants of human unc-5C cDNAs (unc-5Cb, unc-5Cc and unc-5C8) and a 

PT human unc-5HSl cDNA, useful in yeast two hybrid experiments for 

PT identifying unknown human cDNAs which encode proteins that interact with 

PT the human unc-5C protein. 

XX 

PS Example 4; Page 228-230; 246pp; English. 
XX 

CC The present invention describes 3 variants of human unc-5C cDNAs (unc- 

CC 5Cb^ unc-5Cc and unc-5C8) which correspond to alternatively spliced unc- 

CC 5C transcripts, and a human unc-5HSl cDNA which shares homology with the 

CC Rattus norvegicus unc-5HSl cDNA. Also described are assays based on 

CC protein-protein-interactions between the unc-5 protein and a variety of 

CC different interacting proteins. The unc-5C variant cDNAs and unc-5HSl 

CC cDNA are useful in methods for identifying compounds which reduce or 

CC inhibit the lethal phenotype associated with the expression of the unc-5 

CC death domain in yeast. They are also useful in yeast two hybrid 

CC experiments for identifying unknown human cDNAs which encode proteins 

CC that interact with the human unc-5C protein. AAC90914 to AAC90971 and 

CC AAB50646 to AAB50693 represent sequences used in the exemplification of 

CC the present invention 

XX 

SQ Sequence 9700 BP; 2604 A; 2193 C; 2236 G; 2666 T; 0 U; 1 Other; 

Query Match 47.4%; Score 1304.2; DB 4; Length 9700; 

Best Local Similarity 99.8%; Pred. No. 2.1e-242; 

Matches 1306; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 



Qy 


1437 


CACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCAC 


1496 




M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


7 


CACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCAC 


66 


Qy 


1497 


CCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAA 


1556 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


67 


CCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAA 


126 


Qy 


1557 


CTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGA 


1616 




1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


127 


CTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATCCCCCCAGA 


186 


Qy 


1617 


TGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGT 


1676 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


187 


TGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGT 


246 


Qy 


1677 


GAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCC 


1736 




1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 M 1 




Db 


247 


GAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCC 


306 


Qy 


1737 


TGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCC 


1796 




M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


307 


TGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCC 


366 


Qy 


1797 


TGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCT 


1856 



Db 


367 


Qy 


1857 


Db 


427 


Qy 


1917 


Db 


487 


Qy 


1977 


Db 


547 


Qy 


2037 


Db 


607 


Qy 


2097 


Db 


667 


Qy 


2157 


Db 


727 


Qy 


2217 


Db 


787 


Qy 


2277 


Db 


847 


Qy 


2337 


Db 


907 


Qy 


2397 


Db 


967 


Qy 


2457 


Db 


1027 


Qy 


2517 


UD 


1 OR T 


Qy 


2577 


Db 


1147 


Qy 


2637 



MIIIIIMIMIIIIIIMIIIIIIIIMIMIIIIIIIIIIIIIIIIIMIIIIIIM 

TGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCT 



426 



GCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTG 1916 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I M M I M I I I I I 

GCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTG 486 

CTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGC 1976 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I 
CTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGC 546 

TGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTA 2036 

I M M I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I M I I I I I M I 

TGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTA 606 

CAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCT 2096 

I I I I I I I I I I M I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I 1 I I ! I I I I I I I I I I I I 
CAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCT 666 



GGAGAAGCAGCT GGGGGGACAGCT GAT CCAGGAGCCACGGGT CCTGCACTT CAAGGACAG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I M I I I M I I I I 

GGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAG 



2156 



726 



TTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGT^GAGTAAGCT 2216 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I M I I I 
TTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCT 7 8 6 

CCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTT 2276 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I 

CCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTT 846 

GCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCT 2336 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I 
GCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCT 906 

GTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAA 2396 

I I I M I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAA 966 

GGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGT 2456 

I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I M I I I I I 
GGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCAAGCCCTGGT 102 6 

GGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGA 2516 

I I I M I I I I I I I I I I M I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

GGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGA 108 6 

CCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAG 2576 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
CCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAG 114 6 

CCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGA 2636 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M M I I I I I I I I I I I I I I I I I 

CCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGA 1206 

GGCGCGGCACTTCCCC7\ACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGG 2696 
I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I 



Db 12 07 GGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGG 1266 

Qy 2697 CCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 2745 

I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I 
Db 1267 CCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGTC 1315 



RESULT 10 
AAH99530 

ID AAH99530 standard; cDNA; 1321 BP. 
XX 

AC AAH99530; 
XX 

DT 16-OCT-2001 (first entry) 
XX 

DE Huinan protein encoding cDNA sequence SEQ ID NO: 3 65. 
XX 

KW Huinan; cancer; ulcer; HIV infection; human immunodeficiency virus; 

KW antiinflammatory; antirheumatic; antiarthritic; immunosuppressive; 

KW antibacterial; endocrine; cardiant; central nervous system; virucide; 

KW anti-HIV; fungicide; antimutagen; cardiovascular; antianaemic; anaemia; 

KW antiaggregant; haemostatic; vulnerary; antiulcer; osteopathic; eczema; 

KW dermatological; antiallergic; antiasthmatic; antidiabetic; cytostatic; 

KW neuroprotective; antidepressant; nootropic; antiparkinsonian; infection; 

KW immunostimulant; gene therapy; antisense therapy; vaccine; inflammation; 

KW antianaphylactic; rheumatoid arthritis; septic shock; pancreatitis; 

KW cardiac dysfunction; neuropathology; cardiac anaphylaxis; autoimmunity; 

KW genetic disease; haematopoietic disorder; platelet disorder; asthma; 

KW thrombocytopaenia; osteoporosis; severe combined immunodeficiency; 

KW allergic rhinitis; diabetes; multiple sclerosis; depression; 

KW Alzheimer's disease; Parkinson's disease; neurodegenerative disorder; 

KW neurological disorder; ss. 

XX 

OS Homo sapiens. 
XX 

FN WO200153455-A2. 
XX 

PD 26-JUL-2001. 
XX 

PF 22-DEC-2000; 2000WO-US035017 . 
XX 

PR 23-DEC-1999; 99US-00471275 . 

PR 21-JAN-2000; 2 OOOUS-00488725 . 

PR 25-APR-2000; 2 OOOUS-00552317 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT^ Liu C, Drmanac RT; 
XX 

DR WPI; 2001-457603/49. 

DR P-PSDB; AAM25589. 

XX 

PT Isolated human polynucleotides encoding polypeptides, useful for the 

PT treatment and diagnosis of e.g. cancer, ulcers and HIV infection. 
XX 

PS Claim 1; Page 471; 1217pp; English. 
XX 



CC AAH99166 to AAH99904 encode the human proteins given in 7\AM25225 to 

CC AAM25963. The proteins can have activities based on the tissues and cells 

CC they are expressed in, such as: antiinflammatory; antirheumatic; 

CC antiarthritic; immunosuppressive; antibacterial; endocrine; cardiant; 

CC central nervous system; virucide; anti-HIV; fungicide; antimutagen; 

CC cardiovascular; antianaemic; antiaggregant ; haemostatic; vulnerary; 

CC antiulcer; osteopathic; dermatological ; antiallergic; antiasthmatic; 

CC antidiabetic; cytostatic; neuroprotective; antidepressant; nootropic; 

CC antiparkinsonian; and immunostimulant . The proteins and polynucleotides 

CC encoding them can be used in gene therapy, antisense therapy and vaccine 

CC production. The proteins and polynucleotides are useful for screening for 

CC agonists or antagonists of a protein and for the treatment and diagnosis 

CC of disorders associated with the activity of a protein e.g. inflammation, 

CC rheumatoid arthritis, septic shock, pancreatitis, cardiac dysfunction, 

CC neuropathology, cardiac anaphylaxis, viral, bacterial, HIV and fungal 

CC infections, autoimmunity, genetic diseases, haematopoietic disorders, 

CC anaemia, platelet disorders, thrombocytopaenia, wounds, burns, ulcers, 

CC osteoporosis, severe combined immunodeficiency, eczema, allergic 

CC rhinitis, asthma, diabetes, cancer, multiple sclerosis, depression, 

CC Alzheimer's disease, Parkinson's disease, neurodegenerative and 

CC neurological disorders 

XX 

SQ Sequence 1321 BP; 258 A; 440 C; 371 G; 252 T; 0 U; 0 Other; 

Query Match 43.8%; Score 1206.6; DB 4; Length 1321; 

Best Local Similarity 98.0%; Pred. No. l.le-223; 

Matches 1295; Conservative 0; Mismatches 19; Indels 7; Gaps 7; 

CACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCC 1494 
I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I 



M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I 



AACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCA 1614 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

AACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATCCCCCCA 180 

GATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGAC 1674 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

GATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGAC 240 

GTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCC 1734 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I M I I I I I I I I I I I 

GTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCC 300 

CCT-GGCGTCCTGCTCACCCGGCCAGTCATCCT-GGCTATGGACCACTGT-GGGGAGCCC 1791 

III I I II I II M II II M I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I 

CCTGGGCGTCCTGCTTACCCGGCCAGTCATCCTGGGGTATGGACCACTGTGGGGGAGCCC 360 

AGCCCTGACAGCT-GGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGA 1850 

I I I M I M I I I I I I II I I I I II M II I M I II I M I I I I M I I I I I I I I M I I I I I I I I 

AGCCCTGACAGCTGGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGA 420 



Qy 


1435 


Db 


1 


Qy 


1495 


Db 


61 


Qy 


1555 


Db 


121 


Qy 


1615 


Db 


181 


Qy 


1675 


Db 


241 


Qy 


1735 


Db 


301 


Qy 


1792 


Db 


361 



Qy 



1851 TGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAG 1910 



Db 


421 


Qy 


1911 


Db 


481 


Qy 


1971 


Db 


541 


Qy 


2031 


Db 


601 


Qy 


2091 


Db 


661 


Qy 


2151 


Db 


721 


Qy 


2211 


Db 


781 


Qy 


2271 


Db 


841 


Qy 


2331 


Db 


901 


Qy 


2391 


Db 


961 


Qy 


2451 


Db 


1021 


Qy 


2511 


Db 


1081 


Qy 


2571 


Db 


T T ^ T 
1141 


Qy 


2631 


Db 


1201 


Qy 


2690 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M M I I I I I I I M I I I I I 



TGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAG 1970 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGCCTGCTACGTCTTCACCGAGCAGCTGAGCCGCTATGCCCTGGTGGGAGAGGCCCTCAG 54 0 

CGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCT 2030 

I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I 1 I I I I 1 I I I I I I I I I I 
CGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCT 600 

CGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGT 2090 

I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I M I I I I I I I 

CGAGTACAACATACTGGTCTACTGCCTGCATGACACTCACGATGCACTCAACGTAGTGGT 660 

GCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAA 2150 

I I I I I I I I I I I I I I I I I M I I 1 I I I M I I I I I I I I I I I I I M III M II I I I II II 
GCAGCTGGAGAAGCAGCTGCAGGGACAGCTGATCCAGGAGCCACTGGTACTGCACTTCAA 720 

GGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAG 2210 

I I I I I I I II I I II I I I I I I I I I I I I I II I M I I I II I I I I I I I I I I M I I I I I II I I II I 

GGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGi^AGAG 7 80 

TAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCG 2270 

II II I I I I II I I II I M I I I I II I II I I I I I II I I I II I I I I I I I I I I II I I I I I I I I 1 I 
TAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCG 84 0 

GTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTG 2330 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I M I I I I II I I I I I I I II I I I I M 

GTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTG 900 

CAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACAT 2390 
I II I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 1 I I I I I M I I I I I I I I I M I I 
CAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACAT 960 

CACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGC 2450 

I I I I I I I I I I M I I I I I M I I I I I I I I I I M I I I I I I M I I II I I I I I M I I II I I I I I I 

CACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGC 1020 

CCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAG 2510 

I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II II I 
CCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAG 1080 

CCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCT 257 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCT 1140 

GGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCT 2630 

I I I M I I I I I I M I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCT 1200 

GTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT-G 2689 

II II I I I M I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I 

GTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGG 1260 



I I I I I I I I I I I I INI I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1261 GACTGGGCCAGCAGGACGGTGGCTTCTTTCACAGTGTTCGGAGGCTGAGTGCTGAGGCCG 1320 



Qy 2748 G 2748 
1 

Db 1321 G 1321 

RESULT 11 
AAS75738 

ID AAS75738 standard; cDNA; 3646 BP. 
XX 

AC AAS75738; 

XX 

DT 13-FEB-2002 (first entry) 
XX 

DE DNA encoding novel human diagnostic protein #11542. 
XX 

KW Human; chromosome mapping; gene mapping; gene therapy; forensic; 

KW food supplement; medical imaging; diagnostic; genetic disorder; ss. 
XX 

OS Homo sapiens . 
XX 

PN WO200175067-A2. 

XX 

PD ll-OCT-2001. 
XX 

PF 30-MAR-2001; 2001WO-US008631 . 
XX 

PR 31-MAR-2000; 2000US-00540217 . 

PR 23-AUG-2000; 2000US-00649167 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Drmanac RT, Liu C, Tang YT; 
XX 

DR WPI; 2001-639362/73. 

DR P-PSDB; ABG11551. 
XX 

PT New isolated polynucleotide and encoded polypeptides, useful in 

PT diagnostics, forensics, gene mapping, identification of mutations 

PT responsible for genetic disorders or other traits and to assess 

PT biodiversity. 
XX 

PS Claim 1; SEQ ID NO 11542; 103pp; English. 
XX 

CC The invention relates to isolated polynucleotide (I) and polypeptide (II) 

CC sequences. (I) is useful as hybridisation probes, polymerase chain 

CC reaction (PGR) primers, oligomers, and for chromosome and gene mapping, 

CC and in recombinant production of (II) . The polynucleotides are also used 

CC in diagnostics as expressed sequence tags for identifying expressed 

CC genes. (I) is useful in gene therapy techniques to restore normal 

CC activity of (II) or to treat disease states involving (II) . (11) is 

CC useful for generating antibodies against it, detecting or quantitating a 

CC polypeptide in tissue, as molecular weight markers and as a food 

CC supplement. (II) and its binding partners are useful in medical imaging 

CC of sites expressing (II). (I) and (II) are useful for treating disorders 

CC involving aberrant protein expression or biological activity. The 



CC polypeptide and polynucleotide sequences have applications in 

CC diagnostics, forensics, gene mapping, identification of mutations 

CC responsible for genetic disorders or other traits to assess biodiversity 

CC and to produce other types of data and products dependent on DNA and 

CC amino acid sequences. AAS64197-AAS94564 represent novel human diagnostic 

CC coding sequences of the invention. Note: The sequence data for this 

CC patent did not appear in the printed specification, but was obtained in 

CC electronic format directly from WIPO at 

CC ftp . wipo . int/pub/published_pct_sequences 

XX 

SQ Sequence 3646 BP; 930 A; 917 C; 921 G; 877 T; 0 U; 1 Other; 

Query Match 35.2%; Score 970; DB 5; Length 3646; 

Best Local Similarity 61.3%; Pred. No. 6.6e-178; 

Matches 1680; Conservative 1; Mismatches 996; Indels 63; Gaps 5; 

Qy 57 GCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGG 116 

I I I I II I I II II II II II I I I I I III I 

Db 220 GCTCGTGCTACCTGCCCTGGCCCTGCTCAGCGCCAGCGGCACTGGCTCCGCCGCCCAAGA 279 

Qy 117 TGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCC 176 

II I I I I I I I I II I I I I I I I I I I I 

Db 280 TGATGACTTTTTTCATGAACTCCCAGAAACTTTTCCTTCTGATCCACCTGAGCCTCTGCC 339 

Qy 177 CCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGT 236 

II II II I I I I I II II I I I I I II I I II I II I II I I I I I II 
Db 340 ACATTTCCTTATTGAGCCTGAAGAAGCTTATATTGTGAAGAATAAGCCTGTGAACCTGTA 399 

Qy 237 GTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCA 296 

I I I I I I I I I I II I I II M I I I I I I I I I I I I I I I I II I I I II 

Db 400 CTGTTiAAGCAAGCCCTGCCACCCAGATCTATTTCAAGTGTAATAGTGAATGGGTTCATCA 459 

Qy 297 GGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGT 356 

I I I II II I I III II I I I I I MM I I M M 

Db 460 GAAGGACCACATAGTAGATGAAAGAGTAGATGAAACTTCCGGTCTCATTGTCCGGGAAGT 519 

Qy 357 CCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTG 416 

I I M I I III I I II I 1 I I I I I I I I I I I M I I I I I I I II I 
Db 520 GAGCATTGAGATTTCGCGCCAGCAAGTGGAAGAACTCTTTGGACCTGAAGATTACTGGTG 579 

Qy 417 CCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCAT 476 

I II I II I II I I I I I I I II I I II I I I I I II I I I II I M I I II II I I I I I I 

Db 580 CCAGTGTGTGGCCTGGAGCTCCGCGGGTACCACAAAGAGCCGGAAGGCGTATGTGCGCAT 639 

Qy 477 AGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCA 536 

II I I I I I I I I I M I I I I I I II I I I II I II I I I I I I I I I I I 

Db 640 TGCATATCTACGGAAGACATTTGAGCAGGAACCCCTAGGAAAGGAAGTGTCTTTGGAACA 699 

Qy 537 GGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCT 596 

II I I I I I I M I I I I I I I I I I I I I I I I II M II I I I I I I I I 

Db 700 GGAAGTCTTACTCCAGTGTCGACCACCTGAAGGGATCCCAGTGGCTGAGGTGGAATGGTT 759 

Qy 597 CCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCA 656 

II I I I I I I I I I I I III I I I I II I I I I 11 I I 

Db 760 GAAAAATGAAGACATAATTGATCCCGTTGAAGATCGGAATTTTTATATTACTATTGATCA 819 



Qy 



657 CAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAA 716 



Db 


820 


II III 1 1 MMIMI 1 II II IIMIIM 

CAACCTCATCATAAAGCAGGCCCGACTCTCTGATACTGCAAATTACACCTGTGTTGCCAA 


o /y 


Qy 


717 


GAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGG 

1 1 1 II I 1 1 1 1 M 1 IN 1 1 M 1 1 1 II 1 M 1 1 II 1 1 1 1 1 1 

AAACATTGTTGCCAAGAGGAAAAGTACAACTGCCACTGTCATAGTCTATGTCAACGGTGG 


lib 


Db 


880 


o o o 

939 


Qy 


111 


GTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAA 

1 11 1 1 1 1 1 1 1 II II 1 II 1 II M II 1 1 1 II 1 II 1 1 1 1 1 1 1 1 M M 
CTGGTCCACCTGGACGGAGTGGTCTGTGTGTAACAGCCGCTGTGGACGAGGGTATCAGAA 


o o ^ 


Db 


940 


999 


Qy 


837 


ACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCA 

MM III 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 M 

ACGTACAAGGACTTGTACCAACCCGGCACCACTCAATGGGGGTGCCTTCTGTGAAGGGCA 


896 


Db 


1000 


1059 


Qy 


897 


GAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTG 

1 1 1 II 1 1 1 1 1 1 1 1 M II 1 III 1 llllllll II Mill MM II II 
GAGTGTGCAGAAAATAGCCTGTACTACGTTATGCCCAGTGGATGGCAGGTGGACGCCATG 


956 


Db 


1060 


1119 


Qy 


957 


GAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGA 

1 1 1 II II M MM 1 1 1 1 1 1 II M M 1 M 1 II 1 II II II 1 1 

GAGCAAGTGGTCTACTTGTGGAACTGAGTGCACCCACTGGCGCAGGAGGGAGTGCACGGC 


1016 


Db 


1120 


1179 


Qy 


1017 


CCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTG 

llllllll II 1 II 1 1 M M 1 1 1 1 1 1 1 Mill II II 11 1 

GCCAGCCCCCAAGAATGGAGGCAAGGACTGCGACGGCCTCGTCTTGCAATCCAAGAACTG 


1076 


Db 


1180 


1239 


Qy 


1077 


TACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGG 

II II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II II M II M II II 1 1 

CACTGATGGGCTTTGCATGCAGACTGCTCCTGATTCAGATGATGTTGCTCTCTATGTTGG 


1136 


Db 


1240 


1299 


Qy 


1137 


— CCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTA 


1193 


Db 


1300 


1 M M II 1 1 M II 1 II II 1 1 MM II Mill 

GATTGTGATAGCAGTGATCGTTTGCCTGGCGATCTCTGTAGTTGTGGCCTTGTTTGTGTA 


1359 


Qy 


1194 


TTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGG 

II III 1 1 1 11 1 1 1 1 M 1 1 1 1 1 M M MM II 

TCGGAAGAATCATCGTGACTTTGAGTCAGATATTATTGACTCTTCGGCACTCAATGGGGG 


1253 


Db 


1360 


1419 


Qy 


1254 


CTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCA 

11111:11 II MMIMI MM 1 1 MMIII 1 II 

CTTTCARCCTGTGAACATCAAG GCAGCAAGACAAGATCTGCTGGCTGTACC 


1313 


Db 


1420 


1470 


Qy 


1314 


GCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATG- 

1 1 1 1 1 1 1 M 1 1 1 II 1 Ml II II 1 II 1 M 1 1 M 1 I 

CCCAGACCTCACGTCAGCTGCAGCCATGTACAGAGGACCTGTCTATGCCCTGCATGACGT 


1372 


Db 


1471 


1530 


Qy 


1373 


GGCCCAGCCCCAAGTTCCAGCTCACCAAT GGGCACCTGCTCAGCCCCCTGGG 

II 1 II II 1 M II II II 1 II 1 1 MMIMI 

CTCAGACAAAATCCCAATGACCAACTCTCCAATTCTGGATCCACTGCCCAACCTGAAAAT 


1424 


Db 


1531 


1590 


Qy 


14ZO 


TrrrrrrrrrrAr AT AfTGrArrACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTC 

1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 II 
CAAAGTGTACAACACCTCAGGTGCTGTCTCCCCCCAAGATGACCTCTCTGAGTTTACGTC 


1484 


Db 


1591 


1650 


Qy 


1485 


1 CCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 llllllll 


1540 



Db 


1651 


CAAGCTGTCCCCTCAGATGACCCAGTCGTTGTTGGAGAATGAAGCCCTCAGCCTGAAGAA 


1710 


Qy 


1541 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
CCAGAGTCTAGCAAGGCAGACTGATCCATCCTGTACCGCATTTGGCAGCTTCAACTCGCT 


1 c ^ o 


Db 


1711 


1770 


Qy 


1563 


CGGGGGCCGGCTGATGATCCCT7\ATACAGGTATCAGCCTCCTCATCCCCCCAGATGCCAT 

1 1 1 1 1 MM 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

GGGAGGTCACCTTATTGTTCCCAATTCAGGAGTCAGCTTGCTGATTCCCGCTGGGGCCAT 


1622 


Db 


1771 


1830 


Qy 


1623 


ACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGGTT 

M 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 Ml 1 1 1 1 1 1 III Mill 
TCCCCAAGGGAGAGTCTACGAAATGTATGTGACTGTACACAGGAAAGAAACTATGAGGCC 


1682 


Db 


1831 


1890 


Qy 


1683 


GCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGT 

1 1 1 1 1 M II II II II II III II 1 1 1 1 1 1 II II 1 1 M 1 1 M 1 
ACCCATGGATGACTCTCAGACACTTTTGACCCCTGTGGTGAGCTGTGGGCCCCCAGGAGC 


1742 


Db 


1891 


1950 


Qy 


1743 


CCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAG 

1 1 1 M M 1 M 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 III 
TCTGCTCACCCGCCCCGTCGTCCTCACTATGCATCACTGCGCAGACCCCAATACCGAGGA 


1802 


Db 


1951 


2010 


Qy 


1803 


CTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCT 

1 1 1 1 1 II 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
CTGGAAAATACTGCTCAAGAACCAGGCAGCACAGGGACAGTGGGAGGATGTGGTGGTGGT 


1862 


Db 


2011 


2070 


Qy 


1863 


GGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGT 

1 1 1 1 II 1 111 II M 1 1 1 1 Ill 1 1 1 1 1 1 M 1 

CGGGGAGGAAAACTTCACCACCCCCTGCTACATTAAGCTGGATGCAGAGGCCTGCCACAT 


1922 


Db 


2071 


2130 


Qy 


1923 


CTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGC 

1 1 1 1 1 1 M 1 1 1 II II M 1 II 1 1 1 1 II 1 II III II 1 1 1 1 
CCTCACAGAGAACCTCAGCACCTACGCCCTGGTAGGACATTCCACCACCAAAGCGGCTGC 


1982 


Db 


2131 


2190 


Qy 


1983 


CAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACAT 

1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 
AAAGCGCCTCAAGCTGGCCATCTTTGGGCCCCTGTGCTGCTCCTCGCTGGAGTACAGCAT 


2042 


Db 


2191 


2250 


Qy 


2043 


CCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAA 

1 1 1 1 1 1 1 1 1 II II 1 1 M 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

CCGAGTCTACTGTCTGGATGACACCCAGGATGCCCTGAAGGAAATTTTACATCTTGAGAG 


2102 


Db 


2251 


2310 


Qy 


2103 


GCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCA 

III III llllllll 1 1 II II II 1 Ill 

ACAGACGGGAGGACAGCTCCTAGAAGAACCTAAGGCTCTTCATTTTAAAGGCAGCACCCA 


2162 


Db 


2311 


2370 


Qy 


2163 


CAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGT 

1 II M 1 II 1 II 1 1 1 II M 1 M 1 1 II 1 1 M 1 1 1 1 II 1 1 1 1 1 INI 

CAACCTGCGCCTGTCAATTCACGATATCGCCCATTCCCTCTGGAAGAGCAAATTGCTGGC 


2222 


Db 


2371 


2430 


Qy 


2223 


CAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTG 

1 M 1 1 M 1 1 1 1 1 M 1 1 1 M llllllll III 1 1 1 1 1 1 1 1 M 
TAAATATCAGGAAATTCCATTTTACCATGTTTGGAGTGGATCTCAAAGAAACCTGCACTG 


2282 


Db 


2431 


2490 


Qy 


2283 


CACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGT 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 1 II 1 M 1 1 M II M II 1 

CACCTTCACTCTGGAAAGATTTAGCCTGT^CACAGTGGAGCTGGTTTGCAAACTCTGTGT 


2342 


Db 


2491 


2550 



Qy 2343 GTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGACAC 2402 

I I I I I I I I I I I I I I I I I I I I I I I I I I IIMII M I I I INI I 

Db 2551 GCGGCAGGTGGAAGGAGAAGGGCAGATCTTCCAGCTCAACTGCACCGTGTCAGAGGAACC 2610 

Qv 2403 AAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCC 2462 

I M III I Mill II II I I I III II 

Db 2611 TACTGGCATCGATTTGCCGCTGCTGGATCCTGCGAACACCATCACCACGGTCACGGGGCC 2670 

Qy 2463 CAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCACC 2522 

II II I I I I II I I I I I I II I I I I I I M I I I I 1 I I I I I I I I I IN 

Db 2671 CAGTGCTTTCAGCATCCCTCTCCCTATCCGGCAGAAGCTCTGTAGCAGCCTGGATGCCCC 2730 

Qy 2523 CTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCATCT 2582 

I I I I II IIMII III lllllll IIIIIMMI I I 

Db 2731 CCAGACGAGAGGCCATGACTGGAGGATGCTGGCCCATAAGCTGAACCTGGACAGGTACTT 2790 

Qy 2583 CAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCG 2642 

I I lllllll MM IIIMII II I I I Mill I II III 

Db 2791 GAATTACTTTGCCACCAAATCCAGCCCAACTGGCGTAATCCTGGATCTTTGGGAAGCACA 2850 

Qy 2643 GCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCC 27 02 

I II I M I I I II I II I I I I I I I I I II M I III I I I I M I 

Db 2851 GAACTTCCCAGATGGAAACCTGAGCATGCTGGCAGCTGTCTTGGAAGAAATGGGAAGACA 2910 

Qy 2703 AGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I I I I MM II I II 
Db 2911 TGAAACGGTGGTGTCCTTAGCAGCAGAAGGGCAGTATTAA 2950 



ABT06279; 

24- OCT-2002 (first entry) 
Human NOVla coding sequence. 

Human; NOVX; autoimmune disease; cancer; infection; inflammatory disease; 
storage disorder; muscle disorder; neurodegenerative disorder; nootropic; 
developmental defect; neuroprotective; antiparkinsonian; hypotensive; 
hypertensive; haemostatic; cardiant; antianginal; dermatological ; 
immunosuppressive; antiinflammatory; virucide; antibacterial; anti-HIV; 
antiparasitic; antiallergic; antiasthmatic; antirheumatic; antiarthritic; 
vulnerary; anorectic; antidiabetic; immunomodulator ; antipsoriatic; 
nephrotropic; kerolytic; antiulcer; cerebroprotective; anticonvulsant; 
antiinfertility; antimanic; antidepressant; metabolic; cytostatic; 
tranquilizer; analgesic; gene; ss. 

Homo sapiens. 

WO200257450-A2. 

25- JUL-2002. 



PF 29-NOV-2001; 2001WO-US048922 . 
XX 

PR 29-NOV-2000; 2 000US-0253834P . 

PR 30-NOV-2000; 2000US-0250926P . 

PR 25-JAN-2001; 2001US-0264180P . 

PR 20-AUG-2001; 2001US-0313656P . 

PR 05-OCT-'2001; 2001US-0327456P . 

PR 28-NOV-2001; 2001US-00327456 . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Edinger Macdougall JR, Millet I, Ellerman Stone DJ; 

PI Gerlach V, Grosse WM, Alsobrook JP, Lepley DM, Rieger D, Burgess CE; 

PI Gasman SJ, Spytek KA, Boldog FL, Li L, Padigaru M, Mishra V; 

PI Patturajan M, Shenoy S, Rastelli L, Tchernev VT, Vernet CAM; 

PI Zerhusen BD, Malyankar UM, Guo X, Miller CE, Gangolli EA; 

XX 

DR WPI; 2002-590741/63. 

DR P-PSDB; AA018734. 
XX 

PT Novel isolated polypeptide, designated NOVX, useful for treating or 

PT preventing in NOVX-associated disorders e.g. cardiomyopathy, 

PT atherosclerosis, diabetes, cancer, allergy, asthma, Crohn's disease. 

XX 

PS Claim 9; Page 11; 353pp; English. 
XX 

CC The present invention provides the protein and coding sequences of 

CC several novel human proteins, designated NOVX. These can be used in the 

CC treatment of, amongst others, cancers, autoimmune diseases, infections, 

CC inflammatory diseases, storage disorders, muscle disorders, 

CC neurodegenerative diseases and developmental defects. The present 

CC sequence is a coding sequence of the invention 

XX 

SQ Sequence 2860 BP; 555 A; 950 C; 860 G; 495 T; 0 U; 0 Other; 

Query Match 34.0%; Score 936.2; DB 6; Length 2 8 60; 

Best Local Similarity 61.7%; Pred. No. 2.2e-171; 

Matches 1662; Conservative 0; Mismatches 938; Indels 93; Gaps 7; 

Ov 143 ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 202 

III III II I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

168 ACTCCTTCCCGTCAGCGCCAGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACG 227 

Qy 203 TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 262 

I I I M I I M I I I I M I M I I I III I I I M I I I M I I I M II I I 

228 CCTACATTGTGAAGAACAAGCCTGTGGAGCTCCGCTGCCGCGCCTTCCCCGCCACACAGA 287 

Qy 263 TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 322 

111 II I I I M II II II M II M I I M I II I I I I II I I I I I II II 
Db 288 TCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCACACAGGAAGGCC 347 

Qy 323 CAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGG 382 

II I I II I I I I I I I II I I I II I II I I I I t I I I II I 

Db 348 TGGATGAGGCCACCGGTCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGG 4 07 

Qy 383 TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 442 

I Ml II I II lllllllllll llllllllllllllllllll IMIIIIII III 



408 TGGAGGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCGG 467 
443 GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 502 

M I I I I I I I M I I I I MINI I I I I I I I III I I M I I I I I I I I I M I I 

468 GCACCACCAAGAGTCGCCGAGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATC 527 

503 AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 562 

I I I I I M I I M I I I II I I I I I II I II I I I I I I I I I I I 1 I I I I I I I 
52 8 AGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCATGAGGTTCTCCTGCAGTGCCGCCCGC 587 

563 CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 622 

I I I I I M I I II I I I I M II I II I I II II I 11 I I II I I I I I I I I 
588 CGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGTCATCGACCCCA 647 

623 CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 682 

I I I II II I I II I I I II I I I I M I I I I I I I I II I I I I I I I 

64 8 CCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCC 707 

683 TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 742 

I I II II I I II I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I III 

708 TGTCGGACACTGCCAACTATACCTGCGTGGCCAAGAACATCGTGGCCAAACGCCGGAGCA 767 

743 CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 802 

M I II I I M I I I I I I I I I I I I I I II II Mill I III I I I I I I I I I 

7 68 CCACTGCCACCGTCATCGTCTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCAC 827 

803 TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 862 

INI I I lllllll I I I II 

828 CCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAAGCGCACCCGGACCTGCACCAACCCCG 887 

863 CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 922 

I II I I I I I I I I II II Mill Mill Ml lllllll II I II M I III 

8 8 8 CT CCACT CAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGACCGCCTGCACCA 947 

923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 982 

I I I I I I I I I II I I II I I II I I I II I I I II I I I II II i M I I 

948 CCATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTG 1007 

983 ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 

I II I I I II I I I II M II I II II II I II I II M I I M I II M I 

1008 AGTGTGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTG 1067 

1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 

I III II M II III I I I ' ,,,, 

1068 ACTGCAGCGGGACGCTGCTCGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAACTGG 1127 

1103 CTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGG 1162 

I I I I I I I I I I I I M I Ml I I I I 

1128 AGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGG 1187 

1163 TCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAG 1222 

III II II I I II I I II I I I I I I I M M I 

1188 CAATCCTCATGGCGGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTCGACACAG 1247 

1223 ATGTGGCTGACTCGTCCATT CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCA 1279 

I I I M M M I I I MM M I M I I II I I II I I I M I I 

1248 ACATCACTGACTCATCTGCTGCCCTGACTGGTGGTTTCCACCCCGTCAACTTTAAGACGG 1307 



1280 GCAAAGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGCACCACCA 1333 

I I Mill II II II I I II II INN I Mi I 

1308 CAAGGCCCAGTAACCCGCAGCTCCTACACCCCTCTGTGCCTCCTGACCTGACAGCCAGCG 1367 

1334 CCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGA 1370 

II II II I II I I I I I III I I I I I I 

1368 CCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATCCCCATGA 1427 

1371 TGGGCCCAGCCCCAAGTTCCAGCTCACCA 1399 

M I I I I I I M I 1 M I I II 
142 8 CCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCA 1487 

1400 — ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACAC ACTGCACC 1447 

I II I I I I I II II I I I I I I I I III 

1488 CCACGGGCTCTGGGCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTG 1547 

1448 ACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACT 1507 

I I I I I I I I I I I I I I I I I I I I I I II 

154 8 GCACATACCCTAGCGATTTCGCCCGGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCC 1607 

1508 TCCGCTC CCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCT 1552 

II I I I I I I I I I I I I M I I II I I I I I I I I I 

1608 TCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCT 1667 

1553 TCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCC 1612 

I M III II II Mill IIIM Mill mil I M I III 

1668 TTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCCCA 1727 
1613 CAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAG 1672 

I Mill MM II III MM I IN I M I 

1728 ATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAGGCAGAAA 17 87 
1673 ACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGAC 1732 

I M II M II I II I II I I I M II I I M II I M 

1788 GTACCCTGCCGCTTTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGAC 1847 
1733 CCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCA 1792 

II I I M II II I M II I M M I M II I I I II II I I II II II II 

1848 CCACAGGCCTCCTGCTGTGCCGCCCCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCA 1907 
1793 GCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATG 1852 

I I M I M I I I Mini I I I I Mini II M I II M I 

1908 GTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCAGGGCCACTGGGAGGAGG 1967 

1853 TGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTG 1912 

II II II II I I II I I I I I I I II I I II M I II I I I II I I M M I 
1968 TGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGG 2027 

1913 CCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCG 1972 

MM II II I M II II I M I I M I I I II I I I I I I 

2 028 CCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCC 2087 

1973 TGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCG 2032 

I M I II II I I II I II II I I II I I II I II I M I II II II I 

2088 GCTCAGCAGTCAAGCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGG 2147 



Qv 2033 AGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGC 2092 

I I Mill Mill Ml I I II I Mill I II I III Ml 11 

Db 2148 AGTACAGCCTCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGG 2207 

Qv 2093 AGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGG 2152 

MIMIM I Mill III I III MM III II III II II 

Db 2208 AGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGG 2267 

Qv 2153 ACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTA 2212 

MMMIMMMMMM MM I III III II I 

Db 2268 ACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCA 2327 

Qv 2213 AGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGT 2272 

II I I II I Mill M I M II I M II II II I II I 11 I 

Db 2328 AGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGG 2387 

Qv 2273 ACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCA 2332 

I I II II I II I II I II II II I II I I 11 I III MM MUM 

Db 2388 CCCTCCACTGCACTTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCA 2447 

Qv 2333 AGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCA 2392 

II I II M I M I I M I M M I I II I I 11 III I I M I 

Db 2448 AGATCTGCGTGCGGCAAGTGGAAGGGGAGGGCCAGATATTCCAGCTGCATACCACTCTGG 2507 

Ov 2393 CCAAG— GACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAG 2449 

1 M III MM I II Ml I 

Db 2508 CAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACTGTCACCA 2567 

Qv 2450 CCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCA 2509 

Ml I MM II IMMMI M I Mi II IMMMM II 

Db 2568 CCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAACA 2627 

Qv 2510 GCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACC 2569 

I M I I I I I I I Mill M I II II I II I II 11 11 I II I 

Db 2628 GCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTA 2687 

Ov 2570 TGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACC 2629 

Ml II 11 II M M I II II II I I I M II I M III 

Db 2688 TGGACCGGTACCTGAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACC 2747 



2630 TGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTG 

I M II I 11 I II I I I I 1 I I 1 1 1 1 1 Mill II III I 

2748 TCTGGGAAGCTCTGCAGCAGGACGATGGGGACCTCAACAGCCTGGCGAGTGCCTTGGAGG 

Ov 2690 GACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

Mill II M I I I 11 I II I I Ml I 

Db 2808 AGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGGGGACTGCTGA 2860 



Qy 

Db 



2689 
2807 



RESULT 13 

ABT062B0 

ID ABT06280 standard; cDNA; 2860 BP. 
XX 

AC ABT06280; 
XX 

DT 24-OCT-2002 (first entry) 
XX 



KW 
KW 
KW 



DE Human NOVlb coding sequence. 
XX 

Human; NOVX; autoimmune disease; cancer; infection; inf larmatory disease; 
storage disorder; muscle disorder; neurodegenerative disorder; nootropic; 
developmental defect; neuroprotective; antiparkinsonian; hypotensive; 

KW hypertensive; haemostatic; cardiant; antianginal; dermatological; ^ 

KW immunosuppressive; antiinflammatory; virucide; antibacterial; anti HIV; 

KW antiparasitic; antiallergic; antiasthmatic; antirheumatic; antiarthritic; 

KW vulnerary; anorectic; antidiabetic; immunomodulator ; antipsoriatic; 

KW nephrotropic; kerolytic; antiulcer; cerebroprotective; anticonvulsant; 

KW antiinfertility; antimanic; antidepressant; metabolic; cytostatic; 

KW tranquilizer; analgesic; gene; ss. 
XX 

OS Homo sapiens. 
XX 

PN WO200257450-A2. 
XX 

PD 25-JUL-2002. 
XX 

PF 29-NOV-2001; 2001WO-US04 8922 . 
XX 

PR 29-NOV-2000; 2000US-0253834P . 

PR 30-NOV-2000; 2000US-0250926P . 

PR 25-JAN-2001; 2001US-0264180P . 

PR 20-AUG-2001; 2001US-0313656P . 

PR 05-OCT-2001; 2001US-0327456P . 

PR 28-NOV-2001; 2001US-00327456 . 
XX 

PA (CURA-) CURAGEN CORP. 

XX 

PI Edinger S, Macdougall JR, Millet I, Ellerman K, Stone DJ; 

PI Gerlach V, Grosse WM, Alsobrook JP, Lepley DM, Rieger D, Burgess CE; 

PI Gasman SJ, Spytek KA, Boldog FL, Li L, Padigaru M, Mishra V; 

PI Patturajan M, Shenoy S, Rastelli L, Tchernev VT, Vernet CAM; 

PI Zerhusen BD, Malyankar UM, Quo X, Miller CE, Gangolli EA; 

XX 

DR WPI; 2002-590741/63. 

DR P-PSDB; AA018735. 
XX 

FT Novel isolated polypeptide, designated NOVX, useful for treating or 

PT preventing in NOVX-associated disorders e.g. cardiomyopathy, 

atherosclerosis, diabetes, cancer, allergy, asthma, Crohn's disease. 



CC 
XX 
SQ 



Claim 9; Page 14; 353pp; English. 



PT 
XX 
PS 
XX 

CC The present invention provides the protein and coding sequences of 
CC several novel human proteins, designated NOVX. These can be used in the 
CC treatment of, amongst others, cancers, autoimmune diseases, infections, 
CC inflammatory diseases, storage disorders, muscle disorders, 
CC neurodegenerative diseases and developmental defects. The present 
sequence is a coding sequence of the invention 



Sequence 2860 BP; 557 A; 950 C; 858 G; 495 T; 0 U; 0 Other; 

Query Match 34.0%; Score 936.2; DB 6; Length 2860; 

Best Local Similarity 61.7%; Pred. No. 2.2e-171; 

Matches 1662; Conservative 0; Mismatches 938; Indels 93; Gaps 7; 



202 
227 
262 
287 



143 ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 

III III M I I I I I I I I I I I I I 1 I I M I I I I I I > N I I 

1 6 8 ACTCCTTCCCGTCAGCGCCAGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACG 
203 TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 

Mill II IIIIIIIIMI III Mil III III I Mllllll MM 

228 CCTACATTGTGAAGAACAAGCCTGTGGAGCTTCGCTGCCGCGCCTTCCCCGCCACACAGA 
2 63 TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 322 

Ml IIIIMIIIIIIIII MIIMM Mill Mllllll I M II 

288 TCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCACACAGGAAGGCC 347 

323 CAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGG 382 

II I I II II II II II I I I I II MM 

348 TGGATGAGGCCACCGGCCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGG 407 

383 TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 442 

I M I MMIIMMI MMIMMMIMMMM MIIIIIM I I 

408 TGGAGGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCAG 
443 GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 

MMMIMMMM IIIMI MMMI III Illllllllll I 

4 68 GCACCACCAAGAGTCGCCGAGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATC 

503 AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 562 

I M I M I I I II I II I I II I I I I II I I I I I I I t I 

52 8 AGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCATGAGGTTCTCCTGCAGTGCCGCCCGC 587 

5 63 CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 622 



467 

502 
527 



708 TGTC 
743 



GGACACTGCCAACTATACCTGCGTGGCCAAGAACATCGTGGCCAAACGCCGGAGCA 7 67 



CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 802 

Mill I II I I I II I II 1 II I 11 I II II I I M I I I II I II II I I M 
768 CCACTGCCACCGTCATCGTCTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCAC 827 

803 TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 862 

MM I I MMMI II MIIIIMIII II I Mill MM M I 

828 CCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAAGCGCACCCGGACCTGCACCAACCCCG 887 
863 CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 922 

I II IIIIIMI II II Mill Mill III IIIMM M IIIMI III 

888 CTCCACTCAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGACCGCCTGCACCA 947 
923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 982 

M I M II I I II II I I MM I I 11 I II II II II I I 11 I M I I 

948 CCATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTG 1007 



ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 



983 ..-^ 

111 II II I II M I I I I M I I I I I I I II 

1008 AGTGTGCCCACT 



GGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTG 1067 



1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 



1068 ACTGCAGCGGGACGCTGCTCGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAACTGG 1127 
1103 CTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGG 1162 

I I I I I I I I I I I I II I III MM 

1128 AGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGG 1187 
1163 TCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAG 1222 



1188 CAATCCTCATGGCGGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTCGACACAG 1247 

1223 ATGTGGCTGACTCGTCCATT— CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCA 1279 

I I Mlllll II I II II II Mill IMIIII I I III I 
1248 ACATCACTGACTCATCTGCTGCCCTGACTGGTGGTTTCCACCCCGTCAACTTTAAGACGG 1307 

1280 GCAAAGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGCACCACCA 1333 

, I Mill II II M ilMI I Ml I 

1308 CAAGGCCCAGTAACCCGCAGCTCCTACACCCCTCTGTGCCTCCTGACCTGACAGCCAGCG 1367 



1334 CCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGA 13''0 

II Ml I 1 I" "IL 

1368 



CCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATCCCCATGA 1427 



3^37]^ TGGGCCCAGCCCCAAGTTCCAGCTCACCA 1399 

IMIIII I I I II II I I II 
1428 CCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCA 1487 

1400 — ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACAC ACTGCACC 1447 

I II II I I I I I II I I II I I I I III 

1488 CCACGGGCTCTGGGCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTG 1547 

1448 ACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACT 1507 

I 1 I I II II I IMIIII I ' ' 

1548 GCACATACCCTAGCGATTTCGCCCGGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCC 1607 

1508 TCCGCTC CCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCT 1552 

111 II I I II II I II I I I I I I I I 11 I M 

1608 



TCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCT 1667 



1553 TCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCC 1612 



1 II 111 II II Mill Mill Mill Mill I II 1 III 

1668 TTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCCCA 1727 

1613 CAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAG 1672 

I II II I II I I I 1 1 II I I 1! I II M I I 11 I M II I 1 11 I 

-^'j2Q ^^■>,T,m^^^^^^/-,-/-r-7^7^^-?>P^T.^"^ar•r:af::nTr;TaT^.TAr:TCATCAACAAGGCAGAAA 1787 



1788 
1733 



I II II I II 11 I I 1 M I M Ill I • ■ 

ATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAGGCAGAAA 

ACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGAC 1732 

I I Mill I MM Ml II 1 Mill Ml 

GTACCCTGCCGCTTTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGAC 1847 
CCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCA 1792 



I I I I M I I I I I I I M I M I I I I I I I I I I I I M I I I I I I II II 

1848 CCACAGGCCTCCTGCTGTGCCGCCCCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCA 1907 
1793 GCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATG 1852 

I I Mill I M Mill I III I I MMI NUN III I 

1908 GTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCAGGGCCACTGGGAGGAGG 1967 
1853 TGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTG 1912 

MM Mill MINI I I I I II IIIMIIMIIIllll nil I 

1968 TGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGG 2027 

1913 CCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCG 1972 

I II I II I 1 I II I I I I I II II II I I I I I I I I I I I 

2028 CCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCC 2 087 

1973 TGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCG 2032 

I M I I I I II I I I I I I 1 I I I I I I I I I I II I I I I I I M II I 

2088 GCTCAGCAGTCAAGCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGG 2147 

2033 AGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGC 2092 

M II II I I I I I I I I I I M I I I II I I Mill I I MMI I II M I II I II 
2148 AGTACAGCCTCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGG 2207 

2093 AGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGG 2152 

M I M I M I I I I I I II I I Ml I I I I I M II I I M I M 

2208 AGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGG 2267 

2153 ACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTA 2212 

II II I I I I I I II II I II I II I I I M I II I I I MM I II II I I II I I 
2268 ACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCA 2327 

2213 AGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGT 2272 

MM MMMIMMIMMI MIIIIM MM Mill 111 I 

2328 AGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGG 2387 
2273 ACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCA 2332 

I I I M I II I I M I II II I II II I MM III MM I II I M 

2388 CCCTCCACTGCACTTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCA 2447 
2333 AGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCA 2392 

M I M I I I M II II M I II I I I I I M I Ml I I II I ^^^^ 

2448 AGATCTGCGTGCGGCAAGTGGAAGGGGAGGGCCAGATATTCCAGCTGCATACCACTCTGG 2507 

2393 CCAAG— GACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAG 2449 

I M III M II I M 11 I I 

2508 CAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACTGTCACCA 2567 

2450 CCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCA 2509 

M I I I M I M I M I II 11 I I I I M I I II I II M II M I I I M 

2568 CCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAACA 2627 
2510 GCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACC 2569 

M M II I I I I Mill I I II I I I II 1 I II 11 I II I M 

2628 GCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTA 2687 

2570 TGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACC 2629 
M M I I MM I II M II I I II I M II II II II M I I I M III 



Db 2688 TGGACCGGTACCTGAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACC 2747 

Ov 2630 TGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTG 2689 

I i M I I I I I I I I MM M I I I I I I I I I 1 M III I 

Db 2748 TCTGGGAAGCTCTGCAGCAGGACGATGGGGACCTCAACAGCCTGGCGAGTGCCTTGGAGG 2807 

GACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 



Ov 2690 GACTGGGCCAGCCAGACGCTGCiU(JTUTT«^/^^^^i ^± ^.^v^^^v^v. x ^^^^ 

Mill II II I L.„.i.i!.i„i!iiii 

Db 2808 



AGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGGGGACTGCTGA 2860 



RESULT 14 
ABQ93898 

ID ABQ93898 standard; DNA; 2895 



BP. 



XX 
AC 
XX 

DT 

XX 

DE 

XX 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

XX 
OS 

XX 

FH 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

XX 

PN 

XX 

PD 



UNC5H2-like NOVll DNA, SEQ ID NO: 37. 



ABQ93898; 

Ol-NOV-2002 (first entry) 
Human transmeinbrane receptor 

Human; NOVX; neurological disorder; Alzheimer's disease; 
Huntington^s disease; Parkinson's disease; pain; behavioural disorder; 
addiction; tuberous sclerosis; cancer; immune disorder; allergy; 
autoimmune disease; myasthenia gravis; asthma; arthritis; ^^^^^^^^^^ . 
thyroiditis; cardiovascular disease; hypertension; reproductive disorder, 
endometriosis; incontinence; psoriasis; scleroderma; alopecia; ulcer; 
pancreatitis; cirrhosis; glomerular endotheliosis ; bacterial infection; 
polycystic kidney disease; endocrine disorder; obesity; cardiomyopathy; 
atherosclerosis; cell signal processing-related disorder; 
metabolic pathway regulation disorder; cytostatic; neuroprotective; 
antiinflarmnatory; irmuunosuppressive ; analgesic; antiatherosclerotic; _ 
dermatological; antibacterial; antiarthritic; hepatotropic; neurogenesis, 
differentiation; proliferation; motility; haematopoiesis ; wound healing, 
angiogenesis; forensic biology; transgenic animal; drug screening; 
gene therapy; NOVll; transmembrane receptor UNC5H2-like; chromosome 10; 
gene; ds . 



Homo sapiens. 

Key 
5'UTR 

CDS 



sig_peptide 
mat_peptide 

3*UTR 

WO200253742-A2, 
ll-JUL-2002. 



Location/Qualifiers 
1. .30 
/■^tag= a 
31. .2868 
/*tag= b 
/product= "NOVll" 
31. .108 
/*tag- c 
109. .2865 
/*tag= d 

/product= "Mature NOVll protein" 
2869. .2895 
/*tag= e 



XX 

PF 07-JAN-2002; 2002WO-US000375 . 

XX 

PR 05-JAN-2001; 2001US-0260018P . 

PR 08-JAN-2001; 2001US-0260360P . 

PR 28-FEB-2001; 2001US-0272411P . 

PR 02-MAR-2001; 2001US-0272817P . 

PR 05-JUL-2001; 2001US-0303231P. 

PR 12-JUL-2001; 2001US-0305060P. 

PR lO-SEP-2001; 2001US-0318405P. 

PR 12-SEP-2001; 2001US-0318700P . 

PR 04-JAN-2002; 2002US-00037417 . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

DT v^VudP, R Alsobrook JP, Tchernev VT, Liu X, Spytek KA; 

P Kttfrajln r Grosse W^l, LepleyDM, Burgess CE Vernet CAM, L. L; 

PI Gorman L, Edinger S, Sciore P, Ellerman K, Malyankar U; 

PI Rothenbe^g M, Stone D, Boldog F, Guc X, Shenoy S, Anderson D; 

PI Padigaru M, Taupier RJ, Miller CE, Eisen A; 



CC 
CC 
CC 



Novel polypeptides and nucleic acids homologous to transmembrane 
recLtor,^h?mosin, neuromodulin-like family of protexns for dxagnosmg, 
treating canSr, atherosclerosis, neurological, skin and auto.ir^une 



XX 

DR WPI; 2002-583619/62 
DR P-PSDB; ABB09520. 
XX 
PT 
PT 
PT 

PT disorders. 
XX 

PS Claim 9a; Page 121; 323pp; English. 
XX 
CC 

CC 
CC 
CC 
CC 
CC 
CC 
CC 



The invention relates to 24 novel human proteins designated N0V1-N0V14 
(ABB09501-ABB09524), collectively referred to as NOVX proteins, and 
nucleic acids encoding them (ABQ93879-ABQ93902) . NOVX proteins and 
nucleotides are useful in the treatment, diagnosis or P^^^^^^ °f J^!^' 
associated disorders or in the manufacture of a medicament treating 
such disorders, with specific applications described for °f ^he 24 

NOVX proteins, based on their homology to known proteins. Various 
disorders are associated with NOVX proteins including neurological 
CC disorders (e.g., Alzheimer's, Huntington's and Parkinson s diseases), 
CC pain behavioural disorders, addiction, tuberous sclerosis, cancers 
CC ^e":, colorectal cancer, leukaemia and osteosarcoma), immune d^-^ders 
CC e q allergies and autoimmune diseases), myasthenia gravis, asthma, 
CC !:;i;;s forms of arthritis, diabetes, thyroiditis, cardiovascular disease 
CC (e.g., hypertension), reproductive disorders, endometriosis, 
CC incontinence, psoriasis, scleroderma, alopecia, ulcers, pancreatitis, 

cJrSosi" glomerular endotheliosis , polycystic kidney disease, endocrine 
disorders obesity, bacterial infections and particularly cardiomyopathy, 
aiherosci;rosis, cell signal processing-related disorders and disorders 
CC ff metaboUc pathway regulation. NOVX nucleic acids and polypeptides may 
CC be used to identify cellular receptors or downstream effectors which 
CC binds to a NOVX protein, and are also useful as targets for the 
CC ideSi^Lation ol small molecules that modulate or inhibit processes such 
CC as neurogenesis, cell differentiation, cell motility, cellular 
CC proliferation, haematopoiesis , wound healing and angiogenesis . NOVX 
CC nucleic acid Sequences can be used to identify a cell or tissue type and 
CC Tre useful as a source of primers or probes for forensic biology and for 



XX 
SQ 



CC identifying and cloning NOVX homologues in other cell types. Cells 

CC comprising NOVX nucleic acids are useful for producing non-human 

CC transgenic animals which are useful for studying the function and 

CC activity of NOVX proteins and for identifying and evaluating modulators 

CC of NOVX activity. The present sequence represents DNA encoding the 

CC transmembrane receptor UNC5H2-like protein NOVll. The gene encoding NOVll 

CC is located on chromosome 10 

Sequence 2895 BP; 557 A; 960 C; 854 G; 524 T; 0 U; 0 Other; 

Query Match 33.2%; Score 913.6; DB6; Length 2895; 

Best Local Similarity 61.6%; Pred. No. 5.1e-167; 

Matches 1664; Conservative 0; Mismatches 919; Indels 129; Gaps 9, 

Ov 143 ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 

III 111 II I I 1 I I I M I 1 I I I 1 I 1 I 1 1 I I I I 

Db 140 ACTCCTTCCCGTCAGCGCCAGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACG 

nv 203 TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 

Mill II IMIIIIIIII III III Ml II Mill MM 

Db 200 CCTACATTGTGAAGAACAAGCCTGTGGAGCTCCGCTGCCGCGCCTTCCCCGCCACACAGA 

Ov 2 63 TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 

II I I I I I I I I I M II I I I II I I I I M I I I I I M 11 I M I 1 II M 

Db 260 TCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCACACAGGAAGGCC 

Ov 323 CAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGG 

1 I I 1 II I I I 1 II I I I II I 11 Mill I II II I I II 

Db 320 TGGATGAGGCCACCGGTCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGG 

ov 383 TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 

I I II IIIIIIIIMI IIMIIIIMIIMIIIIII IMMIIM III 

380 TGGAGGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCGG 439 



202 
199 
262 
259 
322 
319 
382 
379 
442 



Db 

Qy 443 



502 



Db 

Qy 503 



GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 

I I I I I I I I I II I M I I Ml II MMIII Ml I MM Mill II Mill i 

440 GCACCACCAAGAGTCGCCGAGCCTACGTCCGCATCGCCTGTCTGCGCAAGAACTTCGATC 499 



AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 

I I I I M I I I I I I I I I I I MM M I I Mill Mill Ml 

Db 500 AGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCATGAGGTTCTCCTGCAGTGCCGCCCGC 

Ov 563 CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 

nil 111 MM MM II Ml I 11 Mill Ml Mill I I Mill 
Db 560 CGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGTCATCGACCCCA 

623 CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 

I I I I 11 M 1 M 1 MM M II 11 II 1 I I 11 II II M I I M 

Db 62 0 CCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCC 

Ov 683 TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 

I I 1 11 II M II I M I II II II 1 II I I 1 M 1 I 11 II I M 1 I 11 1 II I I I 



TGTCGGACACTGCCAACTATACCTGCGTGGCCAAGAACATCGTGGCCAAACGCCGGAGCA 

743 



562 
559 
622 
619 
682 
679 
742 
739 



Db 680 

ov 7 4:i CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 802 

II II I 1 I I 11 1 I II I I M M I 1 II I I I I M II I 11 II 11 M M I I 

Db 740 CCACTGCCACCGTCATCGTCTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCAC 799 



803 TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 862 

MM I 1 lllllll M IMIIMMII II 1 mil llllllllllll I 

800 CCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAAGCGCACCCGGACCTGCACCAACCCCG 859 
863 CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 922 

I II II INI Mill Mill III lllllll II II MM Ml 

860 CTCCACTCAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGACCGCCTGCACCA 919 

923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 982 

I I I I I II M II M II I II I II II M I II II I M I I M II I I 

92 0 CCATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTG 979 

9 8 3 ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 

III MlilMIII Mill Mini I II MINI IIIMIII I 

980 AGTGTGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTG 1039 



1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 

I I II MM MUM lllllll II I I I I lllllll 

040 ACTGCAGCGGGACGCTGCTCGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAAAGTG 1099 

TGGCCCTGAGGACGTGGCCCTCT 1129 



I I I I I II I I 



AGTCACAGTGTGGTCCTCCTGTCCCCGCAGTGCTGGAGGCCTCAGGGGATGCGGCGCTGT 



1159 



1 

1103 CTTC 
I I 

1100 

1130 ATGTGGGCCTC ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCC 1186 

I I I M I M I I I I I I III I I I I III " ' ' ' 

1160 ATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGGCAATCCTCATGGCGGTGGGGGTGG 1219 

1187 TCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT— C 1243 

Mill MUM I lllllll Mil N I I 

1220 TGGTGTACCGCCGCAACTGCCGTGACTTCGACACAGACATCACTGACTCATCTGCTGCCC 1279 

1244 TCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGC 1303 

III II M II 1 M M 1 M I I I I 1 I I I I II M I I M 1 

1280 TGACTGGTGGTTTCCACCCCGTCAACTTTAAGACGGCAAGGCCCAGTAACCCGCAGCTCC 1339 

1304 T CACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCT 1357 

I I I II I I I 1 I I I I III I I I I I I ' ' ' o.o 

1340 TACACCCCTCTGTGCCTCCTGACCTGACAGCCAGCGCCGGCATCTACCGCGGACCCGTGT 1399 

TG 1372 

1358 GTCCCCGGCAGGA 

1400 ATGCCCTGCAGGACTCCACCGACAAAATCCCCATGACCAACTCTCCTCTGCTGGACCCCT 1459 

1373 GGCCCAGCCCCAAGTTCCAGCTCACCA- ATGGGCACCTGCTCAGCCCCCT 1421 

lllllll I I I M II I I II I I I I I I I I I I I I 

14 60 TACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCACCACGGGCTCTGGGCCAGGCCTGG 1519 



1422 GGGTGGCGGCCGCCACAC ACTGCACCACAGCTCTCCCACCTCTGAGGCCG 1471 

lllllll I Ml I I I 1 I I I I III 

1520 CAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTGGCACATACCCTAGCGATTTCGCCC 1579 

1472 AGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTC CC 1516 

III lllllll I lllllll II 

1580 GGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCCTCGGTTCCCAGCAGCTCTTGGGCC 1639 



1517 TGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGA 1576 

I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I Mill 
1640 TGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCTTTGGCTGCCTGGGTGGGAGGCTCA 1699 

1577 TGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGA 1636 

I I I I I I 1 II II II 11 I I M I I I I I I I I I I I M I Mill 

1700 GCATCCCCGGCACAGGTGTCAGCTTGCTGGTGCCCAATGGAGCCATTCCCCAGGGCAAGT 1759 

1637 TCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCT 1696 

I I I I I I M I I I II I I I II I I II I I M M M 

1760 TCTACGAGATGTATCTACTCATCAACAAGGCAGAAAGTACCCTGCCGCTTTCAGAAGGGA 1819 

1697 GTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGC 1756 

Mill I I I II I I I I I I I I II II 11 I I 1 I M M I 1 I I 1 11 1 I 
182 0 CCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGACCCACAGGCCTCCTGCTGTGCCGCC 1879 

1757 CAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCC 1816 

I II I I 1 I II I 1 11 I I I I 1 1 I I M II I I I I I II I 1 I 1 

1880 CCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCAGTGCCCGTGACTGGATCTTTCAGC 1939 

18 17 TCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGC 1876 

I I I I I MM Mill I I 1 I I 1 1 1 1 Mill II 11 I I I I 

1940 TCAAGACCCAGGCCCACCAGGGCCACTGGGAGGAGGTGGTGACCCTGGATGAGGAGACCC 1999 

1877 CCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGC 1936 

I I II II II 11 I I I I 1 I 11 1 I M M I 1 M I I I I 1 I II M II 

2000 TGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGGCCTGTCACATCCTGCTGGACCAGC 2059 

1937 TGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGC 1996 

. I ■ I 1 I I I I I I I I I I I 1 I 11 I I I I I I 



I I I I I II I I I I I 1 I 1 1 I 1 I I I I I M I 1 I M I II I 

2060 TGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCCGCTCAGCAGTCAAGCGGCTCCAGC 2119 
1997 TGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTAC7ACATCCGGCT 2056 

2179 



II I M II II I I M II I I II I I M I I I I M I 1 I I II I M M 

TGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGGAGTACAGCCTCCGGGTCTACTGCC 



2120 

2116 



2057 



TGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGAC 

t I I I I I I I I I I I I I I III 



II I Mill I I I I I 1 I I 1 II I I 1 I I M II 1 I 1 I I 1 I Mill Ml 

218 0 TGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGGAGCTGGAGCGGACTCTGGGCGGAT 2239 
2117 '''' 



AGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTAT 

I III MIIIM II I I I II I II I I M II I M I I II I I II I I I I I 

2240 ACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGGACAGTTACCACAACCTGCGCCTCT 2299 
2177 CCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGA 2236 

1,111,11 I M I 1 I I I I 11 I I 1 II I M I I I I 1 I 11 1 M M 1 I 

230 0 CCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCAAGCTGCTGGCCAAATACCAGGAGA 2359 

99-^l TCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGG 2296 

MIIIM II llllll MM Mill III I I I IIMIIM MIMIIMI 
2360 TCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGGCCCTCCACTGCACTTTCACCCTGG 2419 

2297 AGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGG 2356 

Ml MM III 1 I I 1 I 1 I II I 11 I M I II II I I I II 1 I 1 

2420 AGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCAAGATCTGCGTGCGGCAAGTGGAAG 2479 

2357 GCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAG GACACAAGGTTTGCTG 2413 



Db 


2480 




2414 


Db 


2540 




2474 


Db 


2600 


vY 


2534 


Db 


2660 


yy 


2594 


Db 


2720 


Qy 


2654 


Db 


2780 


Qy 


2714 


Db 


2840 


RESULT 


15 



II II I II 11 1 I II „ I I 



I II I II I I I I I I I I M I I I I I I I I I I II 

ACACTCTCTGCTCTGCCCCTGGCAGCACTGTCACCACCCAGCTGGGACCTTATGCCTTCA 2599 

AGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGG 2533 

I I II I I 1 I I I I M I I I I M I I I I M I I I I I I I I I I I I I 

AGATCCCACTGTCCATCCGCCAGAAGATATGCAACAGCCTAGATGCCCCCAACTCACGGG 2659 

GTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTG 2593 

I llllllllll I 11 Mill 111 Mill I I II I I Mill 
GCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTATGGACCGGTACCTGAATTACTTTG 2719 

CCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCA 2653 

I I I I I I I I I I 1 I 11 MM I M I I II I 1 11 I 1 1 I I 

CCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACCTCTGGGAAGCTCTGCAGCAGGACG 2779 
ACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCC 2713 

III II II II I Mill II III I II II I M II 

ATGGGGACCTCAACAGCCTGGCGAGTGCCTTGGAGGAGATGGGCAAGAGTGAGATGCTGG 2839 



111 I I M 1 I I I 



AAS21316 

ID AAS21316 standard; cDNA; 3884 BP. 

XX 

AC AAS21316; 
XX 

DT 24-OCT-2001 (first entry) 
XX 



XX 

DE Human cDNA sequence encoding for PR04326 polypeptide. 

Human secretory and transmembrane; PRO; mammalian; cancer; lung; breast; 
prostate; cervical; tumour necrosis factor-alpha; TNF-alpha; cartilage; 

^.^^^^^A^^. r.^^^r^r^<.^'. fr^f, fsttv acld; skeletal muscle; adipocyte 



ate; cervical; tumour necx-ut^^^^ ^.a.^^^^ c^^i^..^, -^jr — ' - - _ 

ec^; proliferation; glucose; free fatty acid; skeletal muscle; adipocyte; 
A-peptide; factor VIIA; gene therapy; ss. 



XX 
KW 
KW 

KW ear; 
KW 
XX 

OS Homo sapiens . 
XX 

PN WO200140466-A2. 
XX 

PD 07-JUN-2001. 
XX 

PF Ol-DEC-2000; 2000WO-US032678 
XX 

PR Ol-DEC-1999 



PR Ol-DEC-1999 

PR 02-DEC-1999 

PR 02-DEC-1999 

PR 02-DEC-1999 

PR 09-DEC-1999 



99WO-US028301. 
99WO-US028634 . 
99WO-US028551. 
99WO-US028564. 
99WO-US028565. 
99US-0170262P. 



PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

PR 

XX 

PA 

XX 

PI 

PI 
PI 

XX 
DR 
DR 

XX 
PT 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 



16-DEC- 
20-DEC- 
20-DEC- 
30-DEC' 
30-DEC 

05- JAN 

06- JAN 
06-JAN 
11-FEB 
18-FEB- 
18-FEB- 
22-FEB- 
24-FEB- 
24-FEB- 

01- MAR- 

02- MAR- 

03 - MAR- 
IO -MAR- 
IS -MAR- 

20- MAR- 

21- MAR- 
30-M7VR- 
17-MAY- 

22- MAY- 
30-MAY- 
02-JUN- 
05-JUN- 
2 8- JUL- 
11-AUG- 

23- AUG- 
2 4 -AUG- 
08 -NOV 
10 -NOV 



1999, 
1999 
1999 
1999 
1999 
2000 
■2000 
•2000 
■2000 
■2000 
-2000 
-2000 
-2000 
-2000 
'2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
-2000 
2000 
2000 



99WO- 
99WO 
99WO 
99WO- 
99WO- 
2000WO- 
2000WO- 
2000WO' 
2000WO' 
2000WO- 
2 000WO- 
200 0WO- 
2000WO- 
2000WO- 
2000WO- 
2000WO- 
2000US- 
2000WO- 
2000WO- 
2000WO- 
2 000WO 
2000WO 
2000WO 
2 000WO 
2000WO 
2000WO 
2000US 
2000WO 
2 000WO 
2000WO 
2000WO 
2000WO 
2000WO 



US030095 . 
US030911. 
US030999. 
US031243. 
US031274 . 
US000219. 
US000277. 
US000376. 
US003565. 
-US004341. 
■US004342. 
■US004414. 
■US004914 . 
•US005004. 
-US005601. 
-US005841. 
-0187202P. 
-US006319. 
-US006884. 
-US007377. 
-US007532. 
-US008439. 
-US013705. 
-US014042. 
-US014941. 
US015264. 
0209832P. 
US020710. 
-US022031, 
US023522. 
-US023328. 
-US030952. 
-US030873. 



(GETH ) GENENTECH INC. 

Baker KP, Beresini M, Deforge L, Desnoyers L, Filvaroff E, Gao W; 
Gerritsen ME, Goddard A, Godowski PJ, Gurney AL, Sherwood S; 
Smith V, Stewart TA, Tumas D, Watanabe CK, Wood WI, Zhang Z; 

WPI; 2001-408281/43. 
P-PSDB; AAU12244. 

Isolated , secretory and transmembrane PRO polypeptide used to detect 
other PRO polypeptides, link bioactive molecules to cells expressing PRO 
polypeptides, and detect the presence of mammalian tumors e.g. lung, 
breast, prostate, cervical. 

Claim 3; Fig 145; 813pp; English. 

AAS21244-AAS21518 encode for novel human secretory and transmembrane PRO 
polypeptides. The PRO polypeptides are useful to detect other PRO 
polypeptides, to link bioactive molecules to cells expressing PRO 
polypeptides, to modulate biological activities of cells expressing PRO 
polypeptides, and to detect the presence of mammalian lung, colon, 
breast, prostate, rectal, cervical or liver tumours by comparing PRO 
polypeptide expression in a cell sample to that in a control sample. Some 



cc 
cc 
cc 
cc 
cc 



cc 
cc 
cc 
cc 



of the 275 sequences are also useful to stimulate the release of tumour 
necrosis factor-alpha (TNF-alpha) from human blood, the proliferation or 
differentiation of chondrocytes, the proliferation or gene expression in 
pericyte cells, the release of proteoglycans from cartilage, the 
proliferation of inner ear utricular supporting cells or of T- 
CC lymphocytes, the release of a cytokine from peripheral blood monocytes 
CC (PBMCs), or the proliferation of endothelial cells. Some of the PRO 
CC polypeptides may modulate glucose or free fatty acid uptake by skeletal 
muscle cells or by adipocytes; or inhibit binding of A-peptide to factor 
VIIA The PRO polypeptides can be used in assays to identify molecules 
involved in binding interactions. The polynucleotides encoding PRO 
L.^ polypeptides can be used to generate probes, antisense RNA/DNA, 
CC transgenic or knock out animals and can be used in gene therapy 
XX 

SQ Sequence 3884 BP; 767 A; 1278 C; 1162 G; 677 T; 0 U; 0 Other; 

Query Match 32.8%; Score 902.4; DB4; Length 3884; 

Best Local Similarity 61.4%; Pred. No. 7.7e-165; 

Matches 1677; Conservative 0; Mismatches 926; Indels 129; Gaps 9, 
Ov 143 ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 202 

III 1 11 II I M I I I I M I I I I I I I I I I I I I I 1 I I N I 

Db 507 ACTCCTTCCCGTCAGCGCCAGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACG 566 

ov 203 TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 262 

^ I II I I I I I I I I I I I I I I I I I I ill Hi I I 1 I I II I i i I I I I I I 

Db 567 CCTACATTGTGAAGAACAAGCCTGTGGAGCTCCGCTGCCGCGCCTTCCCCGCCACACAGA 

Ov 263 TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 

III 1 1 1 1 1 1 1 1 1 1 1 1 1 II 11 n I I I I 1 I I I I I I 1 I 1 1 1 1 I M II 

Db 627 TCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCACACAGGAAGGCC 

ov 323 CAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGG 

I I I I 11 I I I I I 1 I I I 1 1 1 11 i I I M 1 I 1 M i 1 I 1 

TGGATGAGGCCACCGGCCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGG 

TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 

I III II I II IMIIIIIMI lllilliilMllllIMM Mlllllll I I 

Db 747 TGGAGGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCAG 

ov 443 GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 

I I I II 1 I 1 I II I 1 I 1 I 1 I I I 1 1 1 I 1 1 I I 1 I 1 1 I 1 1 1 I I M 1 I I 

Db 807 GCACCACCAAGAGTCGCCGAGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATC 

ov 503 AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 

I I I M I 1 I I I I I M 1 I I 1 1 1 I I 1 I I 1 I I I I I I INN 1 1 I I I 1 I I 
Db 867 AGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCATGAGGTTCTCCTGCAGTGCCGCCCGC 

Ov 563 CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 

I Mill I I III Mlllllll II INI 11 N i INN 

Db 927 CGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGTCATCGACCCCA 

nv 623 CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 682 

^ III MM MM 1 INI II MM 111 I 1 N IMIMIIM 

CCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCC 1046 



Db 687 
Qy 383 



626 

322 

686 

382 

746 

442 

806 

502 

866 

562 

926 

622 

986 



Db 987 

Qy 



6 8 3 TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 742 



I I I I I I I I I I M I I I I I I I M I I M I I I I I I I I I I M I I M I I I I I Ml 

1047 TGTCGGACACTGCCAACTATACCTGCGTGGCCAAGA^^^^ 
743 CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 802 

,1111 I I M M M II M M II M M M M M I I I I ' ' ' ' ' ' ' " ' , , 

1107 CCACTGCCACCGTCATCGTCTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCAC 1166 
803 TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 862 

MM I I I Ml II I II IMIMIMM II I Mill I Mill II I Ml 1 

1167 CCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAAGCGCACCCGGACCTGCACCAACCCCG 1226 
863 CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 922 

I M 1 II II M I II II MM! Mill Ml I I II II I 11 M M I I II I 

1227 CTCCACTCAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGACCGCCTGCACCA 1286 

923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 982 

, I I I I I I I I I I , I I I MM I II II I I I I M II I I I I 

1287 icATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTG 1346 

983 ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 

III II II M II M I I M I I 11 I 11 1 II MINI M II 11 M I 
1347 AGTGTGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTG 1406 

1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGT- 1101 

I M I MM 11 I I 1 I II I I I I I II I I 1 I I I I 1 I ^^^^ 
1407 ACTGCAGCGGGACGCTGCTCGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAAAATA 1466 

GCTTCTGGCCCTGAGGACGTGGCCCTCT 1129 

^^^'^ I I I I I II 1 II I 1 1 I 

1467 AGAAAACTCTAAGCGACCCCAACAGCCACCTGCTGGAGGCCTCAGGGGATGCGGCGCTGT 1526 

1130 ATGTGGGCCTC ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCC 1186 

I I I I M II I Mill 111 I I I I III " ' ' ' 
1527 ATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGGCAATCCTCATGGCGGTGGGGGTGG 1586 

1187 TCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT C 1243 

I M I I 1 I I I M 1 1 I M II I 1 I II I I 11 I I I I I 

1587 iGGTGTACCGCCGCAACTGCCGTGACTTCGACACAGACATCACTGACTCATCTGCTGCCC 1646 

1244 TCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGC 1303 

III I I M M I M I II I I I I I M I I I M II 11 I I 11 I 

1647 TGACTGGTGGTTTCCACCCCGTCAACTTTAAGACGGCAAGGCCCAGCAACCCGCAGCTCC 1706 

T T CACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCT 1357 

I I I II II I M I I I I M I II II II I II 11 II 

17 07 TACACCCCTCTGTGCCTCCTGACCTGACAGCCAGCGCCGGCATCTACCGCGGACCCGTGT 17 66 

rpQ 1372 

1358 GTCCCCGGCAGGA 

1767 AiGicCTGiiiGicTCCACCGACAAAATCCCCATGACCAACTCTCCTCTGCTGGACCCCT 1826 

1373 GGCCCAGCCCCAAGTTCCAGCTCACCA ATGGGCACCTGCTCAGCCCCCT 1421 

I I I I M I I II I I 1 I M II I II II I I 11 I 11 

1827 TACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCACCACGGGCTCTGGGCCAGGCCTGG 1886 

1422 GGGTGGCGGCCGCCACAC ACTGCACCACAGCTCTCCCACCTCTGAGGCCG 1471 

I I Ml I I I I II 11 1 11 I I I III 



1887 CAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTGGCACATACCCTAGCGATTTCGCCC 1946 

1472 AGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTC CC 1516 

1947 GGGicACCCAiiT(!:CTGiA(!:CTGCG(!:AGCGCCAGCCTCGGTTCCCAGCAGCTCTTGGGCC 2006 
1517 TGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGA 1576 

,1,11111111 I I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mil 

2007 TGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCTTTGGCTGCCTGGGTGGGAGGCTCA 2066 

1577 TGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGA 1636 
I I I I I I I I i I I I I I I II 1 II II I I I I I 



2067 



1637 



2126 



GCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCCCAATGGAGCCATTCCCCAGGGCAAGT 

TCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCT 
MM Mill II 11 I Mill I Ml iL„„.iL.iL. 2186 



1696 



2 127 TCTACGAGATGTATCTACTCATCAACAAGGCAGAAAGTACCCTCCCGCTTTCAGAAGGGA 
1697 GTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGC 1756 

Mill I MM Mi II I IMIMMII I Ml IIMMI Mil 

2187 CCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGACCCACAGGCCTCCTGCTGTGCCGCC 2246 

1757 CAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCC 1816 

I I I M I M I 1111 I M I II I I II 111 1 II I II 1 1 I I 

2247 CCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCAGTGCCCGTGACTGGATCTTTCAGC 2306 

1876 



1817 



2307 



1877 



2367 



1937 



TCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGC 

Mill I II I I I M M I I II I M II M I I I Mill M II I I I I 
TCAAGACCCAGGCCCACCAGGGCCACTGGGAGGAGGTGGTGACCCTGGATGAGGAGACCC 

CCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGC 

I I M 11 I II I I I 1 I 11 1 1 M M M I I I II I I I I I M M II 
TGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGGCCTGTCACATCCTGCTGGACCAGC 2426 

TGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGC 
I I II I I I M i I II I II I 



2366 
1936 



1996 



2427 



TCGiiACCTACGTGTTCACGGGCGAGTCCTATTCCCGCTCAGCAGTCAAGCGGCTCCAGC 2486 

1997 TGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCC 2056 

II I II M II I II I I M I I II M I I I I II I 1 11 I IIMMI 

2487 iGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGGAGTACAGCCTCCGGGTCTACTGCC 2546 

2057 TGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGAC 2116 

I I I M M I I 1 I II II 1 M II 11 M M I M II I I I I II I I 1 I M 
2547 TGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGGAGCTGGAGCGGACTCTGGGCGGAT 2606 

2117 AGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACC^^ 2176 



2607 icTiGGkGiGGiGciGAAACCGCTAATGTTcAAGGACAGTTACCACAACCTGCGCCTCT 2666 
2177 CCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGA 2236 

M MM II I Ml Ml MM Ml Mill M I 11 MMllMII 

2667 CCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCAAGCTGCTGGCCAAATACCAGGAGA 2726 
2237 TCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGG 2296 

I I I I I I I M M II II MM 11 I I 1 I 1 I I I 1 M I I M M I M I II I I II 

2727 TCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGGCCCTCCACTGCACTTTCAC^ 2786 



2297 AGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGG 2356 

III I I I I 111 UN I I! I 11 II i I I M I II 11 I I I M 1 

2787 AGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCAAGATCTGCGTGCGGCAAGTGGAAG 2846 

2357 GCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAG— GACACAAGGTTTGCTG 2413 

I I I I It II I II II II I I I I III I 

' ' ' ' ' 'i' -111 ^^-.m-^r^r^i.r^wn'TrLnmi.n.T.aArAC.r.TaCTGGCICCCIGG 2906 



2847 GGGAGGGCCAGATATTCCAGCTGCATACCACTCTGGCAGAGACACCTGCTGGCTCCCTGG 
14 AGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCA 



I I I I I I I I I I I I I I I I I I I 



2473 

2907 icACTiicTGCTCTGCCCCTCGCAGCACTOTiACCAcicAGCTGGGACCTT^^ 2966 
2474 AGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGG 2533 
MM Ml I III II I I 11 II IN iiiiiUL.L™ ..rTrzvrlrj 

2967 " ^rr^^r^-^n.^-nr-nr-ni, 

2534 GTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTG 2593 

M I I I I I I II I I I II 1 I I I I I I I I I I 

3086 



AGATCCCACTGTCCATCCGCCAGAAGATATGCAACAGCCTAGATGCCCCCAACTCACGGG 3026 



I 1 II 1 1 I I I I I 1 1 1 I I I 1 1 M I 1 1 I I I I I I I I '11';; 

3027 GCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTATGGACCGGTACCTGAATTACTTTG 
2594 CCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCA 2653 

I M 1 1 1 1 1 1 I 1 1 1 1 11 1 1 1 ' ' " 

3087 

2654 ACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCC 2713 



CCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACCTCTGGGAAGCTCTGCAGCAGGACG 3146 



I I I I 



M III 1 Mill " ' 



3147 ATGGGGACCTCAACAGCCTGGCGAGTGCCTTGGAGGAGATGGGCAAGAGTGAGATGCTGG 

2714 TCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 2745 

I I I II 1 II lllllll I 

3207 TGGCTGTGGCCACCGACGGGGACTGCTGAGCC 3238 



3206 
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RESULT 1 
US-08-808-982-1 

Sequence 1, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

MDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 



ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808 , 982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 3014 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-808-982-1 

Query Match 82.1%; Score 2259; DB 2; Length 3014; 

Best Local Similarity 89.7%; Pred. No. 0; 

; Conservative 0; Mismatches 280; Indels 0; Gaps 0; 

ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 105 

IMIIIIIMMIIIIIIIIMIIMII IMIMIMIIIIMIMIIIIII II Ml 

ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 60 



Matches 


2427 


Qy 


46 


Db 


1 


Qy 


106 


Db 


61 


Qy 


166 


Db 


121 


Qy 


226 


Db 


181 


Qy 


286 


Db 


241 


Qy 


346 


Db 


301 


Qy 


406 


Db 


361 



CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 

II II I II M II I I 1 M I I I I 1 I I II M IIIIMII IMIMM IIIMMMII 

CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 
GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 

IMIIIII IllMMMIIMl Mill Mill llllllll IIIIMIIIIMII 

GACCTGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 
GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 

III II I MIIMMMI MM! Mill MIMIMIIM M Mill 

GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 
TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 

Mill llllllll II Mill II M II I M I II Ml I II II I M MM 

TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 
ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 

IllMMMIIMl 11 II M M III I M MIIMMI 

ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 
GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 

IIIIIIIIIIIIIMM IIMMMMMMIMIM IMIMM IIMIIIMIIi 

GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 



165 

120 

225 

180 

285 

240 

345 

300 

405 

360 

465 

420 



525 



466 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 

IIIMIII M Ml IIMIIIIIIIIII IIIMMMII IMIIIII I 

TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 480 



421 
526 



TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 

II IIIIMM illM II II II II II >> 'I " " ""' "' 

481 TCACTGa 



585 



;agcaaggcattgtactaccttgtcgccccccagaaggaatccccccagctgag 54 0 



586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 



541 
646 



lllllllllll II II IIIMIII IIIMIII MMI M MIIMII IIIIM 

GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 



706 
661 
766 



600 
705 
660 
765 

TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 720 



ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 

IIIIIIIIIMIIIIM M IMIMIIMI II IIIMIIMIMIMIII 

601 ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 



TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 

I II II III MM IIIIM M MIIMII Ml IMI_M_IMmi^_ " " 



GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 

IIIIIMMIIIIIMIIIIIIIIM IIIIIIIIIIIIIMMIIIMIMIIMIII 

721 GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 



82 6 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 

IIMMMMMMMMMMMMMIMMMMI IIIIIIIMIIIIIIII III 

7 8 1 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 



886 
841 
946 



TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 

IIIIIIIIIIIIIIMIIIMIIMMMIIIMIIII IIMMMMI II M Ml 

TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 



TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 

Mill IIIIM MM IIMIMMM I II M II M I M II II II I II II 

901 TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 



1006 
961 
1066 
1021 
1126 
1081 
1186 



GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 

IIIIMMMMM IMIIII MMI IIMIIM I Ml IIMMMMI 

GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 
ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 

IIIIM IMIIIMMMI MM III Mill IIIIM MM IMMMMM 

ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 
CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 

Mill I Mill MM INN II HIM MM I IM 

CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 



CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 

III MM 11 M IIIMIII III II MUM IIMIIM IIMMMMI Ml 

1141 CTCATTTACTGTCGCAAGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 



124 6 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 

Mill MM II MM I M MIIIIIIMIIMMIIMIMMIM II I II I 

1201 ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 



825 

780 

885 

840 

945 

900 

1005 

960 

1065 

1020 

1125 

1080 

1185 

1140 

1245 

1200 

1305 

1260 



ACCATCCAGCCAGACCTC 

CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I mil I M M M I I I I M I 1 I I I I 

CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 1380 



1306 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

1 1 1 1 1 1 1 1 1 1 1 1 ' iliillii!iiii!ililii,LLJJ 

1261 n,^.T,rT,^^ 

1366 
1321 
1426 
1381 
1486 
1441 
1546 



GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1485 

I Mllllll II lllllllllllll I Mill Hill IIMIIIM 

AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 



1440 
1545 



CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 

I I I I I I I I M I I M I I I I II I I II I I I 1 I I M I I I I I 1 1 I I I I II M I 

CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 1500 

1605 



GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 

lllllllllllillllllllllllllllllMIIIMIIIIIM II IIIIIIIIMII 

1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 1560 
1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 

II Mill lllMIII Mllllll Mllllll MMIMIIIMII IMMIIM 

ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 



1665 
1620 



1561 

1725 



1666 



CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 
, . , , I I I I I I I I I I I M I M I I I I I M I I II 



II I I 1 1 II M 11 II I M M II 1 I II II II M M II M M I M 1 M M II M II M M 
1621 CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 1680 

1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

Mill Mill II MMMMIMMMMMMMIM M IMIIMIIMIII 

1681 TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 1740 
17 8 6 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 

M I M II I I M M II I I II I M I II II I II M I II II M M M I I II 

1741 GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 1800 
1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

II II MUM MM I 11 II Mini I II illMIIIIIMIIIIIIIIIIillll 

1801 GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 1860 

1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

III 1 II II I I II M 11 I M 11 II M I M I li II 

1861 GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 1920 

1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 

II I II II M I II M I M II I I 11 II II Ml II II M 1 M M II I I I I M M M 

1921 CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCT 1980 



2026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 
I I I I I M I I I I M I II II I I 



2085 



I mTi ^TVl I'lTlTl I I II 1 M II II I II I II I I M II II M I M IMMIIM 

1981 TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 2040 
2086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

II I M II M M II II M II I 1 II II II 1 11 1 II II 11 II M I M I II II I II 1 M I 

2041 GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 2100 

2205 



2146 



TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 



Db 


2101 ' 


Ov 


2206 . 


Db 


2161 . 


Ov 


2266 


Db 


2221 




2326 


Db 


2281 


Ov 


2386 


Db 


2341 


Ov 


2446 


Db 


2401 




2506 


Db 


2461 


Ov 

wy 


2566 


Db 


2521 


Ov 

vy 


2626 


Db 


2581 


Qy 


2686 


Db 


2641 


Qy 


2746 


Db 


2701 



,,,11 M I I I I I I I I 1 I I I M I II II IIIIIIMIM lllllllllllll 

TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 
AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 

I I I I I Mill II I II I II I I I II I M M II IN I I I I I I II I I I I I II I 

AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 

CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 

I I II III M I I I I I I I I I 1 I I I I I I I I II I I I III I II I I I II II I I I I 11 
CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 

GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 

I I I I I I I II I I I I I I II I I I I I I M I I I II I M lllllllllllll II I I 

GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 
AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 

IIIIIIM I I I I I I I I I I I I I I I I I I I I M IMIIMIIIIIIMIII lllllll 

AACATCACTAAGGACACAAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 
CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 

II II I I 11 I I I I I I I M I I I I I I I M I I I I I I I I I Ml I I I I I II 

CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATC 
TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 

Mil llllllllllllll II mil IIIIIMM I II IMIII 

GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 

CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 
II I I II I I I I I I M I I I I I I I I II I I I M I I I M I M I I I M I M I I I I I I I 

CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 



2160 
2265 
2220 
2325 
2280 
2385 
2340 
2445 
2400 
2505 
2460 
2565 
2520 
2625 
2580 
2685 



AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 

Mill III I I III I I I I I I I I I M I I I I M I I I I I I I I I II II I III Hill III 

AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 2640 
GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 2745 

I I MM II Mill IIIIIMIIIMM IIIIIMMII MMI MM I 

GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGAGAC 2700 



MM 



RESULT 2 

US-09-306-902A-1 

Sequence 1, Application US/09306902A 
Patent No. 6277585 

GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
Leonardo, E. David 
Hink, Lindsay 
Masu, Masayuki 
Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 9 
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STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
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ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 
FILING DATE: 07-May-1999 
CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 3014 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-306-902A-1 

Query Match 82.1%; Score 2259; DB 3; Length 3014; 

Best Local Similarity 89.7%; Pred. No . 0 ; 

Matches 2427; Conservative 0; Mismatches 280; Indels 0, Gaps u. 



Qy 


46 


ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 

IMIIIIIIIIIMIIIIIIMIIIMI IMIIIMIIIIMIIMIMIII Mill 

ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 


105 


Db 


1 


60 


Qy 


106 


CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 

II 11 iiiiiiiiiiMM 1 nil 11 11 UN 

CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 


165 


Db 


61 


120 


Qy 


166 


GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 

MINIM M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 M M 1 1 1 1 1 M 1 

GACCTGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 


225 


Db 


121 


180 


Qy 


226 


GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 

Ml II 1 MM Ml Mill Mill Mill MM IMM 

GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 


285 


Db 


181 


240 


Qy 


286 


TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 

II II 1 II M II M II II 1 II II II M II 1 1 1 1 II 1 1 II 1 11 M MM 

TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 


345 


Db 


241 


300 


Qy 


346 


ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 


405 



11 I I I M I I I I I I I II II II M I 11 ii M M N I 11 I I II I I I I I I I I II I I 

301 ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 360 
406 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 465 

I I I I I II M II I I M M I I II 11 M I! I II M I I I M I I I I I I I I II I I 111 I I M I 

GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 420 



466 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 



361 

TACATCCGCATAGCCAGATTGCGCAA 

II Mill I II III Ml II Ml I I III I I I MM II Ml MM Ml 

421 TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 480 
52 6 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 5 8 5 



Mill 11 II II M II II II II M Mill Mill Ml 

481 TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 
586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 

IIIIIIIIIM II II Mllllll IIIIMM IIIM II Mill Ill 

541 GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 
646 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 

MM Mllllll II Mill MIIIMMM II IMMMMMIMMM 

601 ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 
706 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 

II MMMMMIMMM M IIIIMM Ml MM M M MM! M II 

661 TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 

766 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 

IIIMMMMMMMMMMIll II I I I I I I I M I II I I 1 M M II II I M I 1 M 
721 GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 

826 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 

Mill MMMMMIMMIMMMIM MIIMMMMMIM Ml 

781 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 840 
886 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

I I II I II II M II I 1 II I I 1 I I I I I II 1 I II I I I II I I II I I II I I I II 11 M III 

8 4 1 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 9 0 0 
946 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 1005 

Mill M II II I II II II I I I I I 1 II I I I II II II M II II M II I M II M M 

901 TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 960 
1006 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1065 

I I I I I I I I I I II 1 1 II I I I I I N I M II I I M II II 1 1 I M 

GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 1020 



540 
645 
600 
705 
660 
765 
720 
825 
780 
885 



961 
1066 
1021 
1126 
1081 
1186 



ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

M II I II M II I I II II II II M I I I I MM II II 11 I I I I M M II I II II 

ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 1080 
CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1185 

Mill I Mill I 1 II I I I I I I 1 M I II II MM 

CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 1140 
CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

I II MM II II II M I I I I II I M I I M II M II M II M II M II I I I Ml 



GTCGCAPlGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 1200 



1141 CTCATTTACT 

'CT 

1 1 1 M I IN 1 1 111 M 1 1 M 1 1 1 1 1 1 1 1 1 IN LUnii!i!i!!iiili!.iliii 



124 6 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1305 



1201 AcCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 12 6 0 
1306 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

IIIIIIIIIM lllllllllllllllll IIIIIMIIIIMNIMII 111 I II 

1261 ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 



1366 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 

llllllll MIIIIIIIIIIIIMIIMI I Mill llllll Ill Mill 

1321 CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 

1426 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 

I IMIIMI M IMIIMIMMI IMIMIIMMII Mill MIIIMM 

1381 AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 
1486 CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 

I I I I I I I I I I I I M M I M II I M 11 M M II I II II II I 1 II 1 I 11 I 

1441 CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 
1546 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 

IIIIIIMIMMIIMIMMMMMIMMIMIMIMM II I Mill I Ml I I 

1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 
1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 

II Mill llllllll llllllll MIIMM IMMMIMMM lllllllll 
1561 ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 

1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 
II MMMMMIIIMIMMMMIMMMIMMMIMMMM MIIMM 
1621 CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 

1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 

lllll Mill II 1 11 I I II 11 I I M I M I II I M I MIIIIIMIIMI 

1681 TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 
1786 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 

M I M 11 I 1 1 1 M 1 I 1 1 1 I I M 1 I I 1 1 1 M I I 1 M I M II 1 II II II M II M I III 

1741 GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 
1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 

MMMMIIMIMM M llllll I M M 1 1 II II M I II M II M 11 M II I I 
1801 GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 



1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 

Ml 1 I 1 I 1 II I Ml I I I II I I 1 M 1 II I I 11 1 1 II 1 I II I M I I II II 1 

1861 GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 

1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 

illMlllllllIll MIMMMIM Ml MIIMIMM II llllllll M 

1921 CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 1980 
CAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 



1320 
1425 
1380 
1485 
1440 
1545 
1500 
1605 
1560 
1665 
1620 
1725 
1680 
1785 
1740 
1845 
1800 
1905 
1860 
1965 
1920 
2025 



2026 TCCCTCGAGTAi 

mil I llllll MM 111 11 M 1 M I M_l l_[|iliiiiilLii„iij,lii] 

1981 " ' m^m^ 



TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 2040 



Ov 2086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

IIIIIIIIIIIIIIMIIIMII II IIMlllllllllMIIIII II II Ml II II 

Db 2041 GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 2100 

OV 2146 TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 

Mill MIIIMIMIIMIM II 11 III IN II II III II IN 

Db 2101 TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 

Ov 2206 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 

Mill Mill MMMMMIMMMMMM Mill III II III Ml III II 

Db 2161 AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 

Ov 2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 

MM III MMMMIM I 1 Ml MM 

Db 2221 CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 

Ov 2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 

^ II MM Ml MMMMIM MMMMIM II MM Ml MM II Mill Mill 

Db 2281 GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 

Ov 2386 AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 

llllllll IIIIIIIIIIIIIMMIII M I III II III Ml II MM II 111 II 

Db 2341 AACATCACTAAGGACACAAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 

Qy 2446 



Db 2401 



CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 

IIMMMMMMMMIMMMMMMMMIMMMMMMM Mill M 

CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATC 
nv 2506 TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 

MM IIIIMIIIMIM II Ml Mill 1 IIIMMMM 

Db 2461 
Qy 2566 



Db 2521 
Qy 2626 
Db 2581 



GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 
CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 

IIIMMMMMMM II M II I M I I I I I I I M M M M I I MMMIMMMM 

CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 
AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 

IIIM llllllll MIIIMIIIM IIIIIM II I Mill Ml 

AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 



Ov 268 6 GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 

M MMMMIM Mill IIIIIMIIIIIII I Mill MM I Mill MM I 



2205 
2160 
2265 
2220 
2325 
2280 
2385 
2340 
2445 
2400 
2505 
2460 
2565 
2520 
2625 
2580 
2685 
2640 
2745 



Db 2641 



GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGAGAC 2700 



Qy 2746 CGGCCAG 2752 

1 I I I II 

Db 2701 CAGCCAG 2707 



RESULT 3 
US-08-808-982-2 

Sequence 2, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 



APPLICANT: Hink, Lindsay 
APPLICANT: Masu^ Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808,982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1787 base pairs 
TYPE: nucleic acid 
STRANDEDNES S : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-808-982-2 

Query Match 56.8%; Score 1562.4; DB 2; Length 1787; 

Best Local Similarity 98.5%; Pred. No. 0; n p 

Matches 1661; Conservative 0; Mismatches 16; Indels 9; Gaps «; 

Ov 1070 GCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCT 1129 

^ I I I M M I I I I I M I I M I M I I I I I I I I I I I I I I I I I I I I M t I I I I I I I I M 1 I 11 

1 GCAACTGTACCAGTGACCTCTG-GTACACACTGCTTCTGGCCCTGAGGACGTGGCCCTCT 59 

1189 
119 



Db 

Qy 1130 



ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 

IMillMIMMMIIMIIIIMIMIIIMIIIIIIMIIIIIMIillll I 

ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 



Db 60 

Ov 1190 TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 1249 

I I M I I I 1 M I I I I I I I I I I M I I M I M I 1 I I M I I I 1 1 I I M I I I I I Ml M I I I I I ' 



Db 120 
QY 



TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 



179 



1250 CAGGCTTCCAGCCCGTCAGCATC-AAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 1308 

I M I I I I I I I I I I M I I I I I I M M I M I I I I I I M I I I I I I M M M I I M M 

Db 180 CAGG 



3GCTTCCAGCCCGTCAGCATCTAAGCCCAGCA7WVGCAGACAACCCCCATCTGCTCACC 239 



1309 ATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 1368 



480 ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATC 539 
1609 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 1668 

IIMIIMIIIIIIMIIIIIIIIIIIIMIIII I I I I I 1 I I I I 1 I I I I M I I I I 

5 4 0 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 599 
1669 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 1728 



I I I I I I i I I I I I I I I I I I I I I I 1 I I 1 I I lliiiliiilllili 

600 



GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 659 



1729 GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 1788 

III IIMIIIIIMIIIIIMIIIIIIMIIIIIIIIMIIMIMIIIIIIIII 

660 GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 719 
1789 CCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAG 1848 

I I I M I I I I I I I I I I I N I I I I I I I 1 I I I I N I Ml iiliii liiiiiiii 

720 



CCCAGCCCTGACAGCTGGAGCCTGGCCCTCAAAAAGCAGTCGTGCGA 778 



1849 GATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 1908 

Mill iiiiiiiiiiiiii mil mill III! iiiiimmmiiiiii 

77 9 GATGT-CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 837 
1909 AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 1968 

I II I 11 llllim Ml III Ml III Mil I Mi III I I Ml I I I I Ml I I I I I I I I Ml 
838 AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 897 

1969 AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 2028 

1 M II M II II Ml 11 M II M Ml M M II II M I III Ml I Ml III III I M I I III 

AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 957 



898 
2029 

958 
2089 
1018 
2149 



CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 2088 

M iiilMllllllllllllliMlllllllllllllllllllllllllillM 

CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 1017 
GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTC 2148 

II II I M II I II Ml I I III I I II 11 Ml I I M II II I M I II M I Ml I II I M II M 

GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTT- 1076 



AAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAG 



2208 



Db 


1077 


yy 


2209 


Db 


1135 


r\\T 
vy 


2269 




1195 




2329 


nh 


1255 


QY 


2389 


Db 


1315 


Qy 


94 4 Q 

Z. 1 1 -7 




1375 


Qy 


^ w ^ 


Db 


1435 


Qy 


^ 3' 




1495 


Qy 


2629 




1555 


Qy 


2689 


Db 


1615 


Qy 


2747 


Db 


1675 



IIIMIIMIMIIIMMI III IIIIIIIIIMIIIIII MINIM 

AAGGACAGTTACCACAACCT— GCCCTATCATCCACGATGTGCCCAGCTCCCTGTGGAAG 
AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 

I I M M II I M I M M II M II M M M II M M M II M M M M M M I M M M I M 

AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 
CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 

IMIIMIIIIMIIMIMIIIIIMIIIMIIMIIIMIMIIIIMIIII 

CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 
TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 

I I I I I I I I I I M I I I I I M II I II I I I M M M II M I I II I II II M II M II 

TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 
ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 

I M M II II II I M M M II II II II II I I I II II II I II M II II I II I II I I M I I M 

ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 
GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 

I I I I I I I I I I I I I I I M I I II II I I I I M I I I II I II I I M II I I I M II II M 

GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 
AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 

I I I I I I I I I I M M II II II II M II II II I M II I M II I II I II I I M I M II M I I I 

AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 
CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 

III M II II I I I I I M M I I M M I I I II II I I I I I I I I I II II I I I I I II II M 

CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 
CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 

I I I M M II M M II II II I M II I I II i II I II II I II II II II II II I M II I I M I I 

CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 



II M I I II I I II I M II II M I II M I I II I I I I I I M II I I I II I M M 

GGGACTGGCCAGCAGGACGGTGGCTTCTTTCACAGTGTTCGGAGGCTGAGTGCTGAGGCC 



2268 
1194 
2328 
1254 
2388 
1314 
2448 
1374 
2508 
1434 
2568 
1494 
2628 
1554 
2688 
1614 
2746 
1674 



I I I 



RESULT 4 

US-09-306-902A;^^'' 

; Sequence Application US/09306902A 
; Patent NoY 6277585 \ 
; GENERAL B^FORMATIOnI 

APPLlbi^T: Te/sier-Lavigne, Marc 
; Leonardo, E. David 

Hink^ Lindsay 
; Masu, Masayuki 

Kazuko, Keino-Masu 
; TITLE OF INVENTION: Netrin Receptors 

; NUMBER OF SEQUENCES: 9 

CORRESPONDENCE ADDRESS: 



ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 
FILING DATE: 07-May-1999 
CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1787 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

US-09-306-902A-2 

Query Match 56.8%; Score 1562.4; DB 3; Length 1787; 

Best Local Similarity 98.5%; Pred. No. 0; ^ , u n p- 

Matches 1661; Conservative 0; Mismatches 16; Indels 9; Gaps 

Ov 1070 GCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGdACC^GGCCCTGT 1129 

I I M 1 I I I I 1 I I I I I I M 1 I 1 I I I I I I I I I I I I t 1.11 I liililii'liiiiiiiiiii _ 

Db 

Qy 



1 GCAACTGTACCAGTGACCTCTG-GTACACACTGCTTCTGGCCC^AGGACGTGGCCCTCT 59 
1130 ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTbcTGCTTGTCCTCATCCTCG 1189 

IIIIIIIIMIIIIIIIIMIIIMIIIIIIIIIIMMI^IMIMIMMMIIMM 

Db 60 ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 

nv 1190 TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 

IllllllllllllllMIMIlllllllllllllMIMMIIMIIIINMIIIIIM 

Db 12 0 TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 

ov 1250 CAGGCTTCCAGCCCGTCAGCATC-AAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 1308 

M I I I 1 I I I M I M I 1 I I I I I I I I I M I I I I I I 1 1 I M I M I M I I I I I I I I I I 

CAGGCTTCCAGCCCGTCAGCATCTAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 



Db 180 

Qy 



119 

1249 

179 



239 



1309 ATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 1368 

I I M 1 I M M 1 I I 1 M I M I I I I I I M I I I I I I I I M I I 1 I I I M I M 1 M M I I I I I I I 

Db 240 ATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 299 

Qy 1369 GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 1428 



IIIIIMIMIMIIIIMIIMIIIIMIIIIIMIIIIIIIIIIIIIIIIMIIIIII 

300 GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 35 9 
1429 GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 1488 

I I I I 1 I I I I I I I I I I I I 1 I I I M I I I I 1 I I M I I I I M I I M M I I I I I I I I M I I I I I I 

360 GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 4 1 9 
1489 CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 1548 



600 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 659 
1729 GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 1788 

IIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIINMIIIIIIIIII 

660 GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 719 

1789 CCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAG 1848 

I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 1 I I I I M I I I I I I I I I I I 1 M I I I I I I 
720 CCCAGCCCTGACAGCTGGAGCCTGGCCCTCAAAAAGCAGTCGTGCGAGGG-AGCTGGGAG 7 7 8 

1849 GATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 1908 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I N I I I I I I I > I < I I ' 
119 GATGT-CTGCA" 



.CCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 837 



1909 AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 1968 

I I I I I I I I I I I I I I I I I I I I I 1 M I I 1 I I I I I I I I I I I I I I N I N I I I I 

838 AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 897 

1969 AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 2028 

I I I I M I I I I I I I I I I I I I I I I I I I 1 I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I 
8 9 8 AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 

2029 CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 

I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I M I I I I I I I I I I N I I I I I M I I I I I 
9 5 8 CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 



2089 GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTC 

IIIIIIIIIIIIIMIIMIIIMMIIMIIIIIIIIIMIMM' 

1018 



957 
2088 
1017 
2148 



GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTT- 1076 



2208 



2149 AAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAG 

I I I I I I I I I I I I I I I I I M I IN I I I I I I I I I I I I I I I I I I I M I I I I I M I I I 

1077 AAGGACAGTTACCACAACCT— GCCCTATCATCCACGATGTGCCCAGCTCCCTGTGGAAG 1134 



2209 AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I N I I I I 



2268 



Db 


1135 


AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 


1194 


Qy 


2269 


CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 

MIIIIIIIIIMIIMIIIIMIIIIIIIIMIIMIIMIMIIIIIIIIMMIIM 

CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 


o o o o 


Db 


1195 


1254 


Qy 


2329 


TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 

IIIIIIIIIMMIIIIMMIMIMIIIMIMIIIIIIIIIIIIIIIIIIIMMM 

TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 


2388 


Db 


1255 


1314 


Qy 


2389 


ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 

IIMIIIIIIIMMIIIMIIIMIII MIIIMIIMMIIIIMIIM 

ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 


2448 


Db 


1315 


1374 


Qy 


2449 


GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 

MM M 1 1 1 1 1 1 M M 1 1 1 1 M 1 1 1 1 1 M 1 M 1 M 1 1 1 1 1 1 M 1 M 1 1 1 M 1 1 1 1 

GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 


2508 


Db 


1375 


1434 


Qy 


2509 


AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 

MMMIIIIIMIIMIMIIIIIMIIIIIIMIIII 1 1 1 1 1 1 1 1 1 1 M M 1 1 

AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 


2568 


Db 


1435 


1494 


Qy 


2569 


CTGGACAGCCATCTCAGCTTCTTTGCCTCC7\AGCCCAGCCCCACAGCCATGATCCTCAAC 

M M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 N 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 


2628 


Db 


1495 


1554 


Qy 


2629 


CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 

M M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 


2688 


Db 


1555 




Qy 


2689 


GGACTGGGCCAGCCAGACGCTGGCCTC-TTCACAGTG-TCGGAGGCTGAGTGCTGAGGCC 

M M 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 

GGGACTGGCCAGCAGGACGGTGGCTTCTTTCACAGTGTTCGGAGGCTGAGTGCTGAGGCC 




Db 


1615 


1674 


Qy 


2747 


GGCCAG 2752 
1 1 1 1 1 1 

GGCCAG 1680 




Db 


1675 





RESULT 5 

US-09-833-381-1806 

; Sequence 1806, Application US/09833381 

; Patent No. 6672186 

; GENERAL INFORMATION: 

; APPLICANT: Robison, Keith E. 

; TITLE OF INVENTION: No. 6672186el Nucleic Acid and Protein Homologs 
; FILE REFERENCE: 5800-119 

; CURRENT APPLICATION NUMBER: US/ 09/833, 381 

; CURRENT FILING DATE: 2001-04-11 

; PRIOR APPLICATION NUMBER: 09/516,448 

; PRIOR FILING DATE: 2000-02-29 

; NUMBER OF SEQ ID NOS : 2050 

; SOFTWARE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 1806 

; LENGTH: 1282 

; TYPE: DNA 

; ORGANISM: Homo sapiens 



FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (1) . . . (1282) 
OTHER INFORMATION: n = A,T,C or G 



US-09- 


833-381- 


1806 




Query Match 42.0%; Score 1155.8; DB 4; Length 1282; 
Best Local Similarity 98.2%; Pred. No. 3e-240; 

Matches 1263; Conservative 0; Mismatches 13; Indels 10; Gaps 


9; 


Qy 


1469 


CCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAAUiAUl iuuvj(^n^L,ui'cTOL.\.^'^ij^iavj^^^ 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 
CCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCA 


1528 


Db 


1 


60 


Qy 


1529 


CCAGCAACATGACCTATGGGACCTTCAACTTCCTCbb(jb(jUUU^3U i 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

CCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATA 


1588 


Db 


61 


120 


Qy 


1589 


CAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAtjCjCjAACjAi u i ^ i 

MM M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGGAATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCT 


1648 


Db 


121 


180 


Qy 


1649 


ACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGUCUO 1 ACjU i kj^aL- i i 

1 M 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M 1 1 1 1 1 1 N M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

ACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGC 


1708 


Db 


181 


240 


Qy 


1709 


TGAGTCCCATCGTTAGCTGTGGACCCCCT-GGCGTCLi GUi UACUU^uUL^ALji i 

M 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 Ml 

TGAGTCCCATCGTTAGCTGTGGACCCCCTGGGCGTCCTGCTCACCCGGCCAGTCATCCTG 


1766 


Db 


241 


300 


Qy 


1767 


GGCTATGGACCACTGTGGGGAGCCCAGCCC i bAUAbUi bbAbu^^ i kj^^^k^ i ^^/^mm/^'^v^ 

M M M 1 1 1 M M M M M 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 
GGGTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGGAGCCTGCGCCTCAAAAAGC 


1825 


Db 


301 


360 


Qy 


1826 


AGTCGTGCGAGGGCAGCTGGGAGGATGTGC- i bUAL-Ui bbQjUbAb^jA^jtj'w^^ji^'^'^i '^v^^-rt.^ 

M M 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGTCGTGCGAGGGCAGCTGGGAGGATGTGCTTGCACCTGGGCGAGGAGGCGCCCTCCCAC 


1884 


Db 


361 


420 


Qy 


1885 


CTCTACTACTGCCAGCTGGAGGCCAGTGOC^ 1 bt^ i Abb i b i i L.Abb^-H.'^^/^o^ i 

1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGAGCCGC 


1944 


Db 


421 


480 


Qy 


1945 


TTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 M 1 1 1 1 1 1 1 1 M 1 1 M M 

TATGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTG 


2004 


Db 


481 


540 


Qy 


2005 


TTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGAC 

M 1 1 t M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 

TTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATACTGGTCTACTGCCTGCATGAC 


2064 


Db 


541 


600 


Qy 


2065 


ACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATC 

II M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 M 1 1 

ACTCACGATGCACTCAACGTAGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATC 


2124 


Db 


601 


660 


Qy 


2125 


CAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCAC 

M 1 1 1 M M 1 1 M 1 1 1 1 1 1 1 M M 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 M M 1 1 1 1 1 1 1 1 1 M 1 1 

CAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCAC 


2184 


Db 


661 


720 



Qy 


2185 


GATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTT 
1 1 1 1 1 1 t 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M M M 1 1 

GATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTT 


2244 


Db 


721 


780 


Qy 


2245 


TATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

I 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M M 1 1 1 1 1 

TATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTC 


2304 


Db 


781 


840 


Qy 


2305 


AGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGG 

i « til 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 N 1 1 1 1 1 1 N 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGG 


2364 


Db 


841 


900 


Qy 


2365 


CAGAGCTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CAGAGCTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCT 


2424 


Db 


901 


960 


Qy 


2425 


CTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTC 

i . ■ I ■ ■ ■ i I 1 1 L I 1 1 1 1 t 1 1 1 1 1 1 1 1 1 J 1 1 1 1 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGGAGAGTGAAGCGNGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTC 


2484 


Db 


961 


1020 


Qy 


2485 


CTCATTCGGCAGAAGATAA-TTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTG 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 

CTCATTCGGCAGAAGATAATTTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTG 


2543 


Db 


1021 


1080 


Qy 


2544 


GCGG-ACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGC 

t t ■ 1 1 1 1 1 I 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

GCGGAACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGC 


2602 


Db 


1081 


1140 


Qy 


2603 


CCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACC 
, . , , t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 11 1 M 1 
CCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACC 


2662 


Db 


1141 


1200 


Ov 


2663 


TCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAG 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M M 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCAGCCAGCTGG-TGCAGCAGTGGCTGGACTGGGC— AGCAGACGCTGG-CTCTTCACAG 


2722 


Db 


1201 


1256 


Qy 


2723 


TGTCGGAGGCTGAGTGCTGAGGCCGG 274 8 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M 1 1 
TGTCGGAGGCTGAGTGCTGAGGCCGG 1282 




Db 


1257 





RESULT 6 
US-08-808-982-3 

Sequence 3, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu^ Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 



CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/808 , 982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2831 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-808-982-3 

Query Match 30.6%; Score 841.4; DB 2; Length 2831; 

Best Local Similarity 60.0%; Pred. No. 2.4e-172; , ,on q. 

Matches 1638; Conservative 0; Mismatches 961; Indels 130; Gaps 9, 

nv 143 ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 202 

III III II I I 1 I 1 I I I 1 I I I M I M I I I I I 1 I I I I M I I I I 

Db 104 



ACTCCTTCCCATCAGCACCCGCGGAGCAGCTGCCTCACTTCCTGCTGGAACCAGAGGATG 163 

:atcgtcaagaacaagccagtgctgcttgtgtgcaaggccgtgcccgccacgcaga 2 62 



QV 203 TGTAC 

^ llllllll IIMIIIIM I Ml IN_| I I_I1M 1^1 M I 

Db 164 



CCTACATCGTAAAGAACAAGCCAGTGGAATTGCACTGCCGAGCCTTCCCTGCCACACAGA 223 
TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 322 



lIl'llllMIM II 11 llllllll Mill llllllll II I.. 

224 TCTACTTCAAGTGTAATGGCGAGTGGGTTAGCCAGAAAGGCCACGTCACGCAGGAGAGCC 283 
Ov 323 CAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGG 382 

II 1 I II M Ml I 

Db 284 



TGGATGAGGCCACAGGCTTGCGAATACGAGAGGTGCAGATAGAGGTGTCGCGGCAGCAGG 343 



Ov 383 TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 442 

^ 1 III I I II Mill Mill llllllll MMIIMIM IIMIMI MM 

Db 344 TGGAGGAACTTTTTGGGCTCGAGGACTACTGGTGTCAGTGCGTGGCCTGGAGCTCTTCGG 403 

Ov 443 GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 502 

^ 1 IMIMIIIIlll llllllllllllll III 'I'""" " ' 

Db 404 GAACCACCAAGAGTCGCCGAGCCTACATCCGCATTGCCTACTTGCGCAAGAACTTTGACC 463 



503 AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 562 



7 64 CCTGCTCTAACCGCTGCGGCCGAGGTTGGCAGAAACGTACTAGGACCTGCACCAACCCAG 823 
863 CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 922 

III I I I I I I I 1 I 1 I I 1 'oo 

824 CCCCACTCAATGGAGGTGCCTTCTGCGAGGGACAGGCTTGCCAGAAGACGGCTTGCACCA 883 
923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 982 



1004 ACTGCAGCGGGACGCTACTTGACTCCAAGAACTGCACCGATGGGCTGTGCGTGCTGAATC 1063 

1103 CTTCTGGCC rt'^^tt M 1 1 1 M ^ I 

1064 AGAGAACTCTAAACGACCCTAAAAGCCGCCCCCTGGAGCCGTCGGGAGACGTGGCGCTGT 1123 

1130 ATGTGGGCCTC— ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCC 1186 

I I I I I I I I I I I I I I I I II I I I I I ' I " ' " ' 

1124 ATGCGGGCCTCGTGGTGGCCGTCTTTGTGGTTCTGGCAGTTCTCATGGCTGTAGGAGTGA 1183 

1187 TCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT— C 1243 

I I II I I 1 III II I II I I M I I I I I I M II i I 

1184 TCGTGTACCGGAGAAACTGCCGGGACTTCGACACGGACATCACTGACTCCTCTGCTGCCC 1243 

1244 TCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGC 1303 

I I I I 1 I II II I II I I I I I I I I I I I 1 I I I I I I I I I I M I 

1244 TCACTGGTGGTTTCCACCCCGTCAACTTCAAGACTGCAAGGCCCAGCAACCCACAGCTCC 1303 

1304 T CACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCT 1357 



1304 TGCACCCATCCGCCCCTCCGGACCTAACGGCCAGTGCTGGCATCTACCGCGGACCTGTGT 1363 

1358 GTCCCCGGCAGGA '^^ ^^''^ 

1364 ATGCCCTGCAGGACTCTGCCGACAAGATCCCTATGACTAATTCACCCCTTCTGGATCCCT 1423 

1373 GGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCC CCTGG 1423 

I I I I I I M I I I I I I I I I I I I I I I I I " " ' ..o 

1424 TGCCCAGCCTCAAGATCAAGGTCTATGACTCCAGCACCATCGGCTCTGGGGCTGGCCTGG 1483 

1424 GTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTT CGTCT 1483 

I I I I I I I III I I 11 I I I I I I I I M I 

1484 CTGATGGAGCCGACCTGCTGGGTGTCTTACCACCCGGTACATACCCAGGCGATTTCTCCC 1543 

1484 CCCGCCTCTCCACCCAGAAC TACTTCCGCTCCC 1516 

I I I II I II II III II 

1544 GGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCCTTGGTTCCCAGCACCTCCTGGGCC 1603 

1517 TGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGA 1576 

I II MM I I MM I II I Ml Mill I III II II mill 

1604 TCCCTCGAGACCCCAGCAGCAGTGTCAGTGGCACCTTTGGTTGCCTGGGTGGGAGGCTGA 1663 

1577 TGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGA 1636 

MM I I II I M I I I II I Ml 1 II I I I I I 

1664 CCATTCCCGGCACAGGGGTCAGCCTGTTGGTACCAAATGGAGCCATTCCCCAGGGCAAGT 1723 

1637 TCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCT 1696 

II II I I I I I I II I II I M I I II I I I III 

1724 TCTATGACTTGTATCTACGTATCAACAAGACTGAAAGCACCCTCCCACTTTCGGAAGGTT 1783 
1697 GTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGC 1756 

I I M I I M I I 1 M II I II I II I I I I I I I I II I I I MM 

1784 CCCAGACAGTATTGAGCCCCTCGGTGACCTGCGGGCCCACGGGCCTCCTCCTGTGCCGCC 1843 
1757 CAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCC 1816 

Ml MM II I I I I I II I II II II II II II II M I 

1844 CTGTTGTCCTCACTGTGCCCCACTGTGCTGAAGTCATTGCCGGAGACTGGATCTTCCAGC 1903 
1817 TCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGC 1876 

I I I I I MM Mill I II II I M II II II III M I M I M 

1904 TCAAGACCCAGGCCCATCAGGGCCACTGGGAGGAGGTGGTGACTTTGGATGAGGAGACTC 1963 

1877 CCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGC 1936 

I II I M II I M II I II II I II II I II M I II I I I 

1964 TGAACACCCCCTGCTACTGCCAGCTAGAGGCTAAATCCTGCCACATCCTGTTGGACCAGC 2023 

1937 TGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGC 1996 

MM II I I Ml III II I I MM IIIMI III Ml 

2024 TGGGTACCTACGTGTTCACGGGCGAGTCCTACTCCCGCTCCGCAGTCAAGCGGCTCCAGC 2083 

1997 TGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCC 2056 

I II II II I I II II I I II I II II I II II I I II II II II I I I 

2084 TAGCCATCTTCGCCCCAGCCCTCTGCACCTCCCTGGAGTATAGTCTCAGGGTCTACTGTC 214 3 

2057 TGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGAC 2116 



Db 



2144 TGGAGGACACTCCTGCAGCACTGAAGGAGGTCCTAGAGCTGGAGAGGACTCTGGGTGGCT 2203 



Ov 2117 AGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTAT 2176 

I II I II I UN I III III MM II II MM II I _ 

Db 2204 ACTTGGTGGAGGAGCCCAAGACTTTGCTCTTTAAGGACAGTTACCACAACCTACG-CTCT 2262 

Ov 2177 CCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGA 2236 

^ M MM M I III III MM III II M M I II Ml Ml MM 

Db 22 63 CCCTCCATGACATCCCCCATGCCCACTGGAGGAGCAAACTACTGGCCAAGTACCAGGAGA 2322 

Ov 2237 TCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGG 22 96 

^ I Mill II M I Mill MM II Illllll IIIIIIIIM 

Db 2323 TTCCCTTCTACCATGTGTGGAACGGCAGCCAGAAAGCCCTGCACTGCACTTTCACCCTGG 2382 

Ov 2297 AGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGG 2356 

III MM I I I 1 I I I M I I I I I I M I M M I II II I I I 

Db 2383 AGAGACATAGCCTAGCCTCCACTGAGTTCACCTGTAAGGTCTGCGTGCGGCAGGTAGAAG 2442 



QV 2357 GCGACGGGCAGAGCTTCAGCATCAACTTCAACATCAC— CAAGGACACAAGGTTTGCTG 2413 

I 1! II I II I II III I 

Db 2443 GGG; 



Qy 



5AAGGCCAGATTTTCCAGCTGCACACCACGCTGGCTGAGACGCCTGCTGGCTCCCTGG 2502 



Ov 2414 AGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCA 2473 

^ I II I II 1 I I 1 I I I I I I I I I I I I It M M 

ATGCACTCTGCTCTGCCCCTGGCAATGCTGCCACCACACAGCTGGGACCCTATGCCTTCA 



Db 2503 



2562 

2533 



Ov 2474 AGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGG 

MM II I M t I Mil I II MM 

Db 2563 AGATACCACTGTCCATCCGCCAGAAGATCTGCAACAGCCTGGACGCCCCCAACTCACGGG 2622 



2534 GTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTG 2593 

I II II M II I II M I M II III I I M M I IIIIIIIIM 

Db 2623 GCAATGACTGGCGGCTGTTGGCACAGAAGCTCTCCATGGACCGGTACCTGAACTACTTCG 



Ov 2594 CCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCA 

II MM I II Illllll t mil I MM MIII_II__IMII 
Db 2683 CCAO 



2682 

2653 



CAAAGCTAGTCCCACAGGCGTGATCTTAGACCTCTGGGAAGCTCGGCAGCAGGATG 2742 



Ov 2654 ACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCC 2713 

III Illllll Mill II Ml I Illllll M 

ATGGGGACCTCAACAGCCTGGCCAGTGCCTTGGAGGAGATGGGCAAGAGTGAGATGCTGG 2802 



Db 2743 



Qy 2714 TCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I M I II I M II II I I 

Db 2803 TAGCCATGACCACTGATGGCGATTGCTGA 2831 



RESULT 7 

US-09-306-902A-3 

; Sequence 3, Application US/09306902A 

; Patent No. 6277585 

; GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
Leonardo, E. David 
Hink, Lindsay 

. Masu, Masayuki 

Kazuko, Keino-Masu 



TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/306, 902A 
FILING DATE: 07-May-1999 
CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 2831 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
US-09-306-902A-3 



Query Match 30.6%; 
Best Local Similarity 60.0%; 
Matches 1638; Conservative 



Score 841.4; DB 3 
Pred. No. 2.4e-172 
0; Mismatches 961 



Length 2831; 

Indels 130; Gaps 9; 



Qy 


143 


Db 


104 


Qy 


203 


Db 


164 


Qy 


263 


Db 


224 


Qy 


323 


Db 


284 


Qy 


383 



III III 



1 1 



I I I I I I III II I I I I I I I M I I I I II I I I I M 



TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 2 62 

I I I I I M Ill I III I I I I I I I I I I I I I I I 

CCTACATCGTAAAGAACAAGCCAGTGGAATTGCACTGCCGAGCCTTCCCTGCCACACAGA 223 

TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 322 

I I I M I I I I M 1 I I 1 I I I I I I I I I Mill I I M I I I I II III 
TCTACTTCAAGTGTAATGGCGAGTGGGTTAGCCAGAAAGGCCACGTCACGCAGGAGAGCC 283 



I I I I II II III I 



I I I I I I II III 



I I I I I I 



I III I I II I I I M mil I I I I M I I I I I I I I I I I M I I I I I I I I 11 



34 4 TGGAGGAACTTTTTGGGCTCGAGGACTACTGGTGTCAGTGCGTGGCCTGGAGCTCTTCGG 403 
443 GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 502 

I MIIIMIIIIII llllllllllllll III IIMIIIIIIMM II I 

404 GAACCACCAAGAGTCGCCGAGCCTACATCCGCATTGCCTACTTGCGCAAGAACTTTGACC 463 

503 AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 562 

I I M I I I I I I I I I I I I I M I I I II I I II I I I I M I I M I II I I I I 
4 64 AGGAGCCTCTGGCGAAGGAGGTACCCTTGGATCATGAGGTCCTTCTGCAGTGCCGCCCAC 523 

563 CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 622 

I I I I I I MM I I M I I I I M I I I I I I I I 

524 CAGAGGGAGTGCCTGTGGCTGAGGTGGAATGGCTCAAGAATGAAGATGTCATCGATCCCG 583 

623 CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 682 

I I I I I I I I I I Mil I I M M II I I I II M II I I I II 

584 CTCAGGACACTAACTTCCTGCTCACCATTGACCACAACCTCATCATCCGCCAGGCGCGCC 643 
683 TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 742 

I I M I M I M II II I I M I II I II I I I II M I I I II I I II M I I M 

644 TCTCAGACACAGCCAACTACACCTGTGTGGCAAAGAATATTGTGGCCAAGCGCCGGAGCA 703 
743 CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 802 

I Ml M M I II II II II II M II II M I II II II II II II I I I 

704 CGACGGCCACAGTCATCGTCTATGTGAACGGAGGTTGGTCCAGCTGGGCAGAATGGTCAC 763 
803 TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 862 

MM I MM II II II II I M I II I II I III II I M M II I I I I 

764 CCTGCTCTAACCGCTGCGGCCGAGGTTGGCAGAAACGTACTAGGACCTGCACCAACCCAG 823 
863 CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 922 

M M I M II II II II II M II M M II I II II II M II M II M 

824 CCCCACTCAATGGAGGTGCCTTCTGCGAGGGACAGGCTTGCCAGAAGACGGCTTGCACCA 883 
923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 982 

M llllllllll II M MM IIIIMIIIIMIIIIIIM I I 

884 CCGTGTGCCCAGTGGATGGAGCGTGGACTGAGTGGAGCAAGTGGTCCGCCTGCAGCACAG 943 
983 ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 

Ml II II II II I II M M II I M I II Mill I II II M I I 

944 AGTGTGCGCACTGGCGCAGCCGCGAGTGCATGGCACCGCCGCCCCAGAACGGAGGCCGTG 1003 
1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 

MM MM M I II II II M II M II II I M M II 

1004 ACTGCAGCGGGACGCTACTTGACTCCAAGAACTGCACCGATGGGCTGTGCGTGCTGAATC 1063 

1103 CTTCTGGCC CTGAGGACGTGGCCCTCT 112 9 

I MIIIMIIIIII 

1064 AGAGAACTCTAAACGACCCTAAAAGCCGCCCCCTGGAGCCGTCGGGAGACGTGGCGCTGT 1123 
1130 ATGTGGGCCTC— -ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCC 118 6 

M M I II M I II M M II II I I III II III I 
1124 ATGCGGGCCTCGTGGTGGCCGTCTTTGTGGTTCTGGCAGTTCTCATGGCTGTAGGAGTGA 1183 

1187 TCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT C 1243 

I M II I I III II I I I I I I I I M II M M I I I 

1184 TCGTGTACCGGAGAAACTGCCGGGACTTCGACACGGACATCACTGACTCCTCTGCTGCCC 1243 



1244 TCACCTCAGGCTTCCAGCCCGTCAGCATC7\AGCCCAGCAAAGCAGACAACCCCCATCTGC 1303 

MM M M M I M M M I I I I I M I I I M M I I I I I I I 

1244 TCACTGGTGGTTTCCACCCCGTCAACTTCAAGACTGCj^AGGCCCAGCAACCCACAGCTCC 1303 
1304 T CACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCT 1357 

I M M I M M M I I III I I I I I I I I II III 

1304 TGCACCCATCCGCCCCTCCGGACCTAACGGCCAGTGCTGGCATCTACCGCGGACCTGTGT 1363 

1358 GTCCCCGGCAGGA TG 1372 

I I I I II II II 

1364 ATGCCCTGCAGGACTCTGCCGACAAGATCCCTATGACTAATTCACCCCTTCTGGATCCCT 1423 

1373 GGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCC CCTGG 1423 

M II M II I I I I II II I I I I I I I I I Mill 

1424 TGCCCAGCCTCAAGATCTiAGGTCTATGACTCCAGCACCATCGGCTCTGGGGCTGGCCTGG 14 83 

1424 GTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCT 1483 

MM III Ml I I I II I I I I I I I II I 

1484 CTGATGGAGCCGACCTGCTGGGTGTCTTACCACCCGGTACATACCCAGGCGATTTCTCCC 1543 

1484 CCCGCCTCTCCACCCAGAAC TACTTCCGCTCCC 1516 

I I I II I II 

1544 GGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCCTTGGTTCCCAGCACCTCCTGGGCC 1603 

1517 TGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGA 1576 

I M MM I IIIIM II I III MM! I Ml II II MMII 

1604 TCCCTCGAGACCCCAGCAGCAGTGTCAGTGGCACCTTTGGTTGCCTGGGTGGGAGGCTGA 1663 

1577 TGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGA 1636 

M II Mill lllllll I I II I Mill II 

1664 CCATTCCCGGCACAGGGGTCAGCCTGTTGGTACCAAATGGAGCCATTCCCCAGGGCAAGT 1723 

1637 TCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCT 1696 

M M I II I I I I I I I M M I II I I I I M I 

1724 TCTATGACTTGTATCTACGTATCAACAAGACTGAAAGCACCCTCCCACTTTCGGAAGGTT 1783 

1697 GTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGC 1756 

Mill I lllllll II I M I II II I I II I I M I II Ml I 
1784 CCCAGACAGTATTGAGCCCCTCGGTGACCTGCGGGCCCACGGGCCTCCTCCTGTGCCGCC 1843 

1757 CAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCC 1816 

Ml MM MM II M I II I II II I I II II I I I I I 

1844 CTGTTGTCCTCACTGTGCCCCACTGTGCTGAAGTCATTGCCGGAGACTGGATCTTCCAGC 1903 

1817 TCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGC 187 6 

Mill II I I Mill II II I M M III II III I II I II I I 

1904 TCAAGACCCAGGCCCATCAGGGCCACTGGGAGGAGGTGGTGACTTTGGATGAGGAGACTC 1963 

1877 CCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGC 1936 

I II I I I I M II I M II I I I M I I Mill II II I MUM 

1964 TGAACACCCCCTGCTACTGCCAGCTAGAGGCTAAATCCTGCCACATCCTGTTGGACCAGC 2023 

1937 TGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGC 1996 

MM II I I I II M I II I I I II I I M 

2024 TGGGTACCTACGTGTTCACGGGCGAGTCCTACTCCCGCTCCGCAGTCAAGCGGCTCCAGC 



2083 



Qy 


1997 


TGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCC 

1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

TAGCCATCTTCGCCCCAGCCCTCTGCACCTCCCTGGAGTATAGTCTCAGGGTCTACTGTC 


2056 


Db 


2084 


2143 


Qy 


2057 


TGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGAC 

M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 
TGGAGGACACTCCTGCAGCACTGAAGGAGGTCCTAGAGCTGGAGAGGACTCTGGGTGGCT 


2116 


Db 


2144 


2203 


Qy 


2117 


AGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTAT 

1 III II 1 II 1 1 1 II 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ACTTGGTGGAGGAGCCCAAGACTTTGCTCTTTAAGGACAGTTACCACAACCTACG-CTCT 


2176 


Db 


2204 


2262 


Qy 


2177 


CCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGA 

1 1 1 1 M II 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 
CCCTCCATGACATCCCCCATGCCCACTGGAGGAGCAAACTACTGGCCAAGTACCAGGAGA 


2236 


Db 


2263 


2322 


Qy 


2237 


TCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGG 

1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II III 1 1 1 1 1 1 1 1 1 1 M 

TTCCCTTCTACCATGTGTGGAACGGCAGCCAGAAAGCCCTGCACTGCACTTTCACCCTGG 


2296 


Db 


2323 


2382 


Qy 


2297 


AGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGG 

III 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 II 1 1 1 M I 1 1 1 1 1 1 1 II 1 
AGAGACATAGCCTAGCCTCCACTGAGTTCACCTGTAAGGTCTGCGTGCGGCAGGTAGAAG 


2356 


Db 


2383 


2442 


Qy 


2357 


GCGACGGGCAGAGCT T CAGCAT CAACTT CAACAT CAC- - - CAAGGAC ACAAGGTTT GCTG 

II II II 1 1 1 1 M II II III 1 

GGGAAGGCCAGATTTTCCAGCTGCACACCACGCTGGCTGAGACGCCTGCTGGCTCCCTGG 


2413 


Db 


2443 


2502 


Qy 


2414 


AGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCA 

1 II 1 II 1 III 1 1 1 1 1 1 1 1 11 M 1 1 1 1 1 1 
ATGCACTCTGCTCTGCCCCTGGCAATGCTGCCACCACACAGCTGGGACCCTATGCCTTCA 


2473 


Db 


2503 


2562 


Qy 


2474 


AGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCACCCTGTAGGCGGG 

1 1 1 1 1 1 1 1 II II II II 1 1 II 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 
AGATACCACTGTCCATCCGCCAGAAGATCTGCAACAGCCTGGACGCCCCCAACTCACGGG 


2533 


Db 


2563 


2622 


Qy 


2534 


GTGCCGACTGGCGGACTCTGGCCCAGATU^CTCCACCTGGACAGCCATCTCAGCTTCTTTG 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GC/JJi.TGACTGGCGGCTGTTGGCACAGAAGCTCTCCATGGACCGGTACCTGAACTACTTCG 


2593 


Db 


2623 


2682 


Qy 


2594 


CCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCA 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 
CCACCAAAGCTAGTCCCACAGGCGTGATCTTAGACCTCTGGGAAGCTCGGCAGCAGGATG 


2653 


Db 


2683 


2742 


Qy 


2654 


ACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCC 

III 1 1 1 1 1 1 1 1 1 1 1 1 II 111 1 1 1 1 1 1 1 1 II 
ATGGGGACCTCAACAGCCTGGCCAGTGCCTTGGAGGAGATGGGCAAGAGTGAGATGCTGG 


2713 


Db 


2743 


2802 


Qy 


2714 


TCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

1 II MM 1 II 1 1 1 1 1 
TAGCCATGACCACTGATGGCGATTGCTGA 2831 




Db 


2803 





RESULT 8 

US-09-833-381-1807 

; Sequence 1807, Application US/09833381 
; Patent No. 6672186 



GENERAL INFORMATION: 
APPLICANT: Robison, Keith E. 

TITLE OF INVENTION: No. 667218 6el Nucleic Acid and Protein Homologs 
FILE REFERENCE: 5800-119 

CURRENT APPLICATION NUMBER: US/09/833, 381 
CURRENT FILING DATE: 2001-04-11 
PRIOR APPLICATION NUMBER: 09/516,448 
PRIOR FILING DATE: 2000-02-29 
NUMBER OF SEQ ID NOS : 2050 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 1807 
LENGTH: 1605 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/ KEY : mis c__f eature 
LOCATION: (1) . . . (1605) 
OTHER INFORMATION: n = A,T,C or G 
US-09-833-381-1807 

Query Match 15.7%; Score 432; DB 4; Length 1605; 

Best Local Similarity 62.5%; Pred. No. 4e-84; 

Matches 737; Conservative 0; Mismatches 435; Indels 7; Gaps 4; 

Qy 1515 CCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCT 1574 

I I I I I I I I MM I I I M I I I Mill Ml 

Db 421 CCTGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCTTTGGCTGCCTGGGTGGGAGGCT 4 80 

Qy 1575 GATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAA 1634 

I MMI MMI MMI I M I Ml I MMI MM II II 

Db 481 CANCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCCCAATGGAGCCATTCCCCAGGGCAA 540 

Qy 1635 GATCTATGA— GATCTACCTCACGCTGCACi^AGCCGGAAGACGTGAGGTTGCCCCTAGCT 1692 

I M M M II I M I M I I M II II 

Db 541 GTTCTACGAAGATGTATTCTACTCATCAACAAGGCAGAAAGTACCCTCCCGCTTTCAGAA 600 

Qy 1693 GGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACC 1752 

II MM I M II II I II I II II II II M I II I M M I M I 

Db 601 GGGACCCANACAGTATTGAGCCCCTCGGTGACCTGTGGACCCACAGGCCTCCTGCTGTGC 660 

Qy 1753 CGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTG 1812 

I I M II II M II I II I II II I I II II III I M M I I I 

Db 661 CGCCCCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCAGTGCCCGTGACTGGATCTTT 720 

Qy 1813 CGCCTCAAA7^AGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAG 1872 

I Mini MM MUM I I M II I II II II MMI I M M I 
Db 721 CAGCTCAAGACCCAGGCCCACCAGGGCCACTGGGANGAGGTGGTGACCCTGGATGAGGAG 780 

Qy 1873 GCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAG 1932 

II I II II II II II I II I I M M I I II II I II MMI II 

Db 781 ACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGGCCTGTCACATCCTGCTGGAC 840 

Qy 1933 CAGCTGGGCCGCT-TTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCT 1991 

M II M II I II I I I I II M II I M I M II II II I 

Db 8 41 CAGCTGGGCACCTACCGTGTTCACGGGCGAGTCCTATTCCCGCTCAGCAGTCAAGCGGCT 900 



Qy 



1992 CAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTA 2 051 



I Mill I II II II I IIIIIIIMMI IMIIII I IIIIIIMII 

Db 901 CCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGGAGTACAGCCTCCGGGTCTA 960 

Qy 2 052 CTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGG 2111 

II I I I M I I I M I II II II I M I I I I I I I I M I I I II II I 

Db 961 CTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGGAGCTGGAGCGGACTCTGGG 1020 

Qy 2112 GGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCG 2171 

III I Ml IIIMM II IIIIIIIIIIIIIIIIMIIIIIIM 

Db 1021 CGGATACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGGACAGTTACCACAACCTGCG 1080 

Qy 2172 CCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCA 2231 

III III IN! I Ill 111 II IN Mill 

Db 1081 CCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCAAGCTGCTGGCCAAATACCA 1140 

Qy 2232 GGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCAC 2291 

I I I I II II II II M II II M II II I II II III I I I II II II M II M I 
Db 1141 GGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGGCCCTCCACTGCACTTTCAC 1200 

Qy 22 92 CCTGGAGCGTGTCAGCCCCAG-CACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGG 2350 

I II II M I II I I I I I I MM M M M II I I II I I 

Db 1201 CCTGGAGAGGCACAGGCTTGGCCTCCACAGAGCTCACCTGCAAGATCTGCGTGCGGCAAG 1260 

Qy 2351 TGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAG— GACACAAGGT 2407 

I I M II II II M M III I I II I I M I II 

Db 1261 TGGAAGGGGAGGGCCAGATATTCCAGCTGCATACCACTCTGGCAGAGACACCTGCTGGCT 1320 

Qy 2408 TTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTG 2467 

II II I M M I I I II I I M I II II 

Db 1321 CCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACTGTCACCACCCAGCTGGGACCTTATG 1380 

Qy 2468 CCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCACCCTGTA 2527 

I I I II I I I M I I I I I M II I I I II II I I II II II M I I II 

Db 1381 CCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAACAGCCTAGATGCCCCCAACT 1440 

Qy 2528 GGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCT 2587 

Mill 11 I 11 I I I II II I I M I II II I MM I 

Db 1441 CACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTATGGACCGGTACCTGAATT 1500 

Qy 2588 TCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACT 2647 

I I I II I II II I II I II M M I IMIIII II I II II II M II M 

Db 1501 ACTTTGCCACCAAAGCGAGCCCCACGGNTGTGATCCTGGACCTCTGGGAAGCTCTGCAGC 1560 

Qy 2648 TCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGG 2686 

MM II II I II MM M III 

Db 1561 AGGACGATGGGGACCTCAACAGCCTGNCGAGTGCCTTGG 1599 



RESULT 9 

US-08-253-155A-17/C 

; Sequence 17, Application US/08253155A 

; Patent No. 5691147 

; GENERAL INFORMATION: 

; APPLICANT: Gyuris, Jeno 

; APPLICANT: Draetta, Giulio 

TITLE OF INVENTION: CDK4 Binding Proteins 
; NUMBER OF SEQUENCES: 95 



CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD 
STREET: 60 State Street 
CITY: Boston 
STATE: MA 
COUNTRY: USA 
ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: ASCII (text) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/253 , 155A 
FILING DATE: 02-JUN-1994 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Vincent^ Matthew P. 
REGISTRATION NUMBER: 36,709 
REFERENCE/ DOCKET NUMBER: MI I- 02 8 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 227-7400 
TELEFAX: (617) 227-5941 
INFORMATION FOR SEQ ID NO: 17: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 771 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-253-155A-17 

Query Match 9.8%; Score 2 69; DB 1; Length 771; 

Best Local Similarity 67.6%; Pred. No. 4.6e-49; 

Matches 406; Conservative 1; Mismatches 191; Indels 3; Gaps 2; 

Qv 1700 AGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAG 1759 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

7 38 AGACAGTATTGAGCCCCTCGGTGACCTGTGGACCCACAGGCCTCCTGCTGTGCCGCCCCG 679 

Qy 1760 TCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCA 1819 

IIIMM I IN Ill II Ml I Mill I I 

67 8 TCATCCTCACCATGCCCCACTGTGCCGAAGTCAGTGCCCGTGACTGGATCTTTCAGCTCA 619 
Qv 1820 AAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCT 1879 

II II I I I I M M M I II M II II I II I II I I II I II I I I 

Db 618 AGACCCAGGCCCACCAGGGCCACTGGGAGGAGGTGGTGACCCTGGATGAGGAGACCCTGA 559 

Qv 18 8 0 CCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGG 1939 

I I II M I II I M II I M II I I M I I I II I II I M I I II 

Db 558 ACACACCCTGCTACTGCCAGCTGGAGCCCAGGGCCTGTCACATCCTGCTGGACCAGCTGG 499 

Ov 1940 GCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGC 1999 

11 M 1 I Ml Ml M I I II I IMIII Ml Mill 

Db 498 GCACCTACGTGTTCACGGGCGAGTCCTATTCCCGCTCAGCAGTCAAGCGGSTCCAGCTGG 439 



Qy 



2000 TTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGC 2059 



I II II II I 1 1 1 1 II II 1 1 II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

Db 438 CCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGGAGTACAGACTCCGGGTCTACTGTCTGG 379 

Qy 2060 ATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGC 2119 

Mill I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I M I 

Db 37 8 AGGACAGGCCTGTAGCACTGAAGGAGGTGCTGGAGCTGGAGCGGACTCTGGGCGGATACT 319 

Qy 2120 TGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCA 2179 

Ml I II I I II II I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

Db 318 TGGTGGAGGAGCCGAAACCGCTAATGTTCAAGGACAGTTAACAC/U^CCTGCG-CTCTTCC 260 

Qy 218 0 TCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCC 2239 

MUM MM III II I I I I I I M II I I I I I I I I I I I I I I I I I I 

Db 259 TCCATGACCTCCCCCATGCCCATTGGAGGAGCAAGCTGCTGGCCAAATACCAGGAGATCC 200 

Qy 2240 CCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGC 2299 

II II I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 199 CCTTCTATCACATTTGGAGTGGAGCCAGAAGG— CCTCCACTGCACTTTCACCCTGGAGA 142 

Qy 2300 G 2300 

I 

Db 141 G 141 



RESULT 10 
US-08-808-982-4 

Sequence 4, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo^ E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: • 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: PatentIn Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808, 982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 



; TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 305 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 
; MOLECULE TYPE: cDNA 
US-08-808-982-4 



Query Match 4.3%; Score 119; DB 2; Length 305; 

Best Local Similarity 68.1%; Pred. No. 7.8e-17; 

Matches 209; Conservative 0; Mismatches 95; Indels 3; Gaps 3; 



Qy 


1862 


TGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACG 
111 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 

TGGATGAGGAGACCCTGAACACACCCl CjC i AC i Lr— i ± »j 


1921 


Db 


1 


SR 


Qy 


1922 


TCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCG 

III 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCCTGCTGGACCAGCTGGGCACCTACGTTTTCACGGGCGAGTCCTATTCCCGCTCAGCAG 


1981 


Db 


59 


118 


Qy 


1982 


CCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACT'iACA 

1 1 II II III 1 1 II 1 1 M II II 1 M 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 
TCAAGCGGCTCCAGCTGGCCGT-TTCGCCCCCGCCCTCTGCACCTCCCTGGAGTACAGCC 


2041 


Db 


119 


177 


Qy 


2042 


TCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGA 

1 1 1 1 II 1 1 M 1 II 1 1 1 1 1 II 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 

TCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGGAGCTGGAGC 


2101 


Db 


178 


237 


Qy 


2102 


AGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACC 

1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GGACTCTGGGCGGATACTTGGTGGAGGAGCCGAT^CCGCTAATGTTCAAGGACAGTTACC 


2161 


Db 


238 


297 


Qy 


2162 


ACAACCT 2168 

1 1 1 1 1 1 1 
ACAACCT 304 




Db 


298 





RESULT 11 
US-09-306-902A-4 

; Sequence 4, Application US/09306902A 
; Patent No. 6277585 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink^ Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

; TITLE OF INVENTION: Netrin Receptors 

; NUMBER OF SEQUENCES: 9 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 
; CITY: SAN FRANCISCO 

; STATE: CALIFORNIA 



COUNTRY: USA 

ZIP: 94104 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWT^RE: PatentIn Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 
; FILING DATE: 07-May-1999 

; CLASSIFICATION: <Unknown> 

ATTORNEY/AGENT INFORMATION: 
; NAME: OSMAN, RICHARD A 

; REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 305 base pairs 

; TYPE: nucleic acid 

; STRANDEDNESS: double 

; TOPOLOGY: linear 

; MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
US-09-306-902A-4 



Query Match 4.3%; Score 119; DB 3; Length 305; 

Best Local Similarity 68.1%; Pred. No. 7.8e-17; 

Matches 209; Conservative 0; Mismatches 95; Indels 3; Gaps 3; 



Qy 


1862 


TGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACG 

III 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 
TGGATGAGGAGACCCTGAACACACCCTGCTACTG-CAGCTGGAGCCCAGGGCCTG-TACA 


1921 


Db 


1 


58 


Qy 


1922 


TCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCG 

ill 1 1 1 M 1 1 1 1 1 1 II 1 1 Ml 1 1 1 1 1 1 1 1 1 1 

TCCTGCTGGACCAGCTGGGCACCTACGTTTTCACGGGCGAGTCCTATTCCCGCTCAGCAG 


1981 


Db 


59 


118 


Qy 


1982 


CCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACA 

1 1 1 1 II III 1 1 II 1 1 M II II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
TCAAGCGGCTCCAGCTGGCCGT-TTCGCCCCCGCCCTCTGCACCTCCCTGGAGTACAGCC 


2041 


Db 


119 


177 


Qy 


2042 


TCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGA 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 II 
TCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGGAGCTGGAGC 


2101 


Db 


178 


237 


Qy 

Db 


2102 
238 


AGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACC 

1 1 1 1 II II 1 1 III 1 1 1 1 1 1 1 II 1 1 II M 1 1 

GGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGGACAGTTACC 


2161 
297 


Qy 


2162 


ACAACCT 2168 

M 1 1 1 1 1 

ACAACCT 304 




Db 


298 





RESULT 12 
US-08-985-526-2 

Sequence 2, Application US/08985526 
Patent No. 6080728 
GENERAL INFORMATION: 

APPLICANT: Mixson, James A 

TITLE OF INVENTION: CARRIER: DNA COMPLEXES CONTAINING DNA 
TITLE OF INVENTION: ENCODING ANT I -ANGIOGENIC PEPTIDES AND THEIR USE IN 
GENE 

TITLE OF INVENTION: THERAPY 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Connolly, Bove, Lodge, & Hutz 
STREET: 1220 Market Street, P.O. Box 2207 
CITY: Wilmington 
STATE: Delaware 
COUNTRY: U.S.A. 
ZIP: 19899 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/985, 526 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/608,845 
FILING DATE: 16-JUL~1996 
ATTORNEY/AGENT INFORMATION: 

NAME: McMorrow Jr., Robert G 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (302) 658-9141 
TELEFAX: (302) 658-5613 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 657 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
US-08-985-526-2 

Query Match 1.9%; Score 53.6; DB 3; Length 657; 

Best Local Similarity 52.7%; Pred. No, 0.012; 

Matches 116; Conservative 0; Mismatches 104; Indels 0; Gaps 0; 

Qy 756 CATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAG 815 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I IN III 

Db 222 CAGCGACTCTGCGGACGATGGCTGGTCTCCATGGTCCGAGTGGACCTCCTGTTCTACGAG 281 

Qy 816 CTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGG 875 

II M II II M I I M M I I Mil III I I 

Db 282 CTGTGGCAATGGAATTCAGCAGCGCGGCCGCTCCTGCGATAGCCTCAACAACCGATGTGA 341 

Qy 876 GGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGT 935 

I II I I I II III I I I I I I I I I I I I 



Db 342 GGGCTCCTCGGTCCAGACACGGACCTGCCACATTCAGGAGTGTGACAAAAGATTTAAACA 401 



Qy 936 AGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGT 975 

I I I I I M I I I I I I III I II II M I I I I 
Db 402 GGATGGTGGCTGGAGCCACTGGTCCCCGTGGTCATCTTGT 441 



RESULT 13 
US-08-985-526-4 

Sequence 4, Application US/08985526 
Patent No. 6080728 
GENERAL INFORMATION: 

APPLICANT: Mixson, James A 

TITLE OF INVENTION: CARRIER: DNA COMPLEXES CONTAINING DNA 
TITLE OF INVENTION: ENCODING ANT I -ANGIOGENIC PEPTIDES AND THEIR USE IN 
GENE 

TITLE OF INVENTION: THERAPY 
NUMBER OF SEQUENCES: 43 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Connolly, Bove, Lodge, & Hutz 
STREET: 1220 Market Street, P.O. Box 2207 
CITY: Wilmington 
STATE: Delaware 
COUNTRY: U.S.A. 
ZIP: 19899 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/985,526 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/608,845 
FILING DATE: 16-JUL-1996 
ATTORNEY/AGENT INFORMATION : 

NAME: McMorrow Jr., Robert G 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (302) 658-9141 
TELEFAX: (302) 658-5613 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1326 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
US-08-985-526-4 

Query Match 1.9%; Score 53.6; DB 3; Length 1326; 

Best Local Similarity 52.7%; Pred. No. 0.015; 

Matches 116; Conservative 0; Mismatches 104; Indels 0; Gaps 0; 

Qy 756 CATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAG 815 

M I I I I I I I I I M I I I I I I I I I I I I M I I I I I I Ml III 

Db 222 CAGCGACTCTGCGGACGATGGCTGGTCTCCATGGTCCGAGTGGACCTCCTGTTCTACGAG 281 



Qy 816 CTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGG 875 

I I I I I I II I I I I I I I I I I I I I I III I I 

Db 2 82 CTGTGGC7\ATGGAATTCAGCAGCGCGGCCGCTCCTGCGATAGCCTCAACAACCGATGTGA 341 

Qy 876 GGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGT 935 

II I I I I II III MM I I I I I I I I 

Db 342 GGGCTCCTCGGTCCAGACACGGACCTGCCACATTCAGGAGTGTGACAAAAGATTTAAACA 401 

Qy 936 AGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGT 975 

I I I I I I I I M M I III I MINI I I II 

Db 402 GGATGGTGGCTGGAGCCACTGGTCCCCGTGGTCATCTTGT 441 



RESULT 14 

US-09-252-991A-7947 

Sequence 7947, Application US/09252991A 
Patent No. 6551795 
GENERAL INFORMATION: 
APPLICANT: Marc J. Rubenfield et al . 

TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 

FILE REFERENCE: 107196.136 

CURRENT APPLICATION NUMBER: US/09/252, 991A 
CURRENT FILING DATE: 1999-02-18 
PRIOR APPLICATION NUMBER: US 60/074,788 
PRIOR FILING DATE: 1998-02-18 
PRIOR APPLICATION NUMBER: US 60/094,190 
PRIOR FILING DATE: 1998-07-27 
NUMBER OF SEQ ID NOS : 33142 
SEQ ID NO 7947 
LENGTH: 699 
TYPE: DNA 

ORGANISM: Pseudomonas aeruginosa 
US-09-252-991A-7947 

Query Match 1.9%; Score 52.8; DB 4; Length 699; 

Best Local Similarity 47.6%; Pred. No. 0.019; 

Matches 156; Conservative 0; Mismatches 172; Indels 0; Gaps 0; 
Qy 1824 GCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCA 1883 

II I M I II I I I I I I I II I I I I II I III M 

Db 160 GCCGAAGTACGACGTCTACGACAGCAACGAGGTGCTCGAGGCCAAGCTGCTGTCCGGGCA 219 

Qy 1884 CCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCG 1943 

I MM II I M I I M I I I I I M II I I I I I I III 

Db 220 TTCCGGCTACGACCTGGTGGTGCCCAGCGACAGCTTCCTGCCCAACTACCTGAAGGCCGA 279 

Qy 1944 CTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCT 2003 

I II I I II M III I II I M II M I I II 

Db 28 0 GGTGTTCCAGCCGCTGGACAAGAGCAAGCTGCCGAACTGGAAGAACCTCAACCCGGCCCT 339 

Qy 2004 GTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGA 2063 

II I I I I M II I II II M I I II Mill I 

Db 34 0 GCTCAAGGTGCTCGCCGGCAAGGACCCCGGCAACCGCTACGTGATGCCCTACATGTGGGG 399 



Qy 2064 CACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGAT 2123 

M I I I I I I I I II I I 

Db 400 CACCAACGGCATCGCCTACAACCTCGACAAGGTCCGCGCGGTGCTCGGCGACGATGCGCC 459 

Qy 2124 CCAGGAGCCACGGGTCCTGCACTTCAAG 2151 

I I II I M I II I I I I I I I I 
Db 460 GCTGGACTCCTGGGACCTGGTGTTCAAG 4 87 



RESULT 15 

US-09-252-991A-78 09 

; Sequence 7809, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

APPLICANT: Marc J. Rubenfield et al . 
; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

; TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: 107196.136 

; CURRENT APPLICATION NUMBER: US/ 09/2 52 , 991A 
; CURRENT FILING DATE: 1999-02-18 
; PRIOR APPLICATION NUMBER: US 60/074,788 
; PRIOR FILING DATE: 1998-02-18 
; PRIOR APPLICATION NUMBER: US 60/094,190 
; PRIOR FILING DATE: 1998-07-27 
; NUMBER OF SEQ ID NOS : 33142 
; SEQ ID NO 7 8 09 
; LENGTH: 1302 
TYPE: DNA 

ORGANISM: Pseudomonas aeruginosa 
US-09-252-9 91A-7 809 



Query Match 1.9%; Score 52.8; DB 4; Length 1302; 

Best Local Similarity 47.6%; Pred. No. 0.022; 

Matches 156; Conservative 0; Mismatches 172; Indels 0; Gaps 



Qy 


1824 


GCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCA 

III II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 
GCCGAAGTACGACGTCTACGACAGCAACGAGGTGCTCGAGGCCAAGCTGCTGTCCGGGCA 


1883 


Db 


360 


419 


Qy 


1884 


CCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCG 

1 II 1 1 II 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 III 
TTCCGGCTACGACCTGGTGGTGCCCAGCGACAGCTTCCTGCCCAACTACCTGAAGGCCGA 


1943 


Db 


420 


479 


Qy 


1944 


CTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCT 

1 1 1 1 1 II II III 1 1 1 1 1 1 1 II 1 1 1 II 
GGTGTTCCAGCCGCTGGACAAGAGCAAGCTGCCGAACTGGAAGAACCTCAACCCGGCCCT 


2003 


Db 


480 


539 


Qy 


2004 


GTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGA 

II 1 1 1 M 1 1 1 1 MM M 1 1 1 1 II 1 1 1 1 
GCTCAAGGTGCTCGCCGGCAAGGACCCCGGCAACCGCTACGTGATGCCCTACATGTGGGG 


2063 


Db 


540 


599 


Qy 


2064 


CACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGAT 

1 II 1 1 M 1 1 1 1 1 II 1 1 1 II II 1 1 1 
CACCAACGGCATCGCCTACAACCTCGACAAGGTCCGCGCGGTGCTCGGCGACGATGCGCC 


2123 


Db 


600 


659 


Qy 


2124 


CCAGGAGCCACGGGTCCTGCACTTCAAG 2151 





I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 660 GCTGGACTCCTGGGACCTGGTGTTCAAG 687 



Search completed: July 12, 2004, 20:01:20 
Job time : 199 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: July 12, 2004, 18:05:13 ; Search time 1173 Seconds 

(without alignments) 
11434.720 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10~624-932-l 
2752 

1 ccgcggggccccgcgcccgg tgagtgctgaggccggccag 2752 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



3183909 seqs, 2436941669 residues 



Total number of hits satisfying chosen parameters; 



6367818 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_NA: * 

1: /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB.seq:* 
2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 
3 : / cgn2_6/ptodata/2 /pubpna/US 0 6__NEW_PUB .seq:* 
4: /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB.seq:* 
5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 
6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB.seq:* 
7: /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB.seq: ^ 
8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq: * 
9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 
10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB.seq:* 
11: /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB,seq:* 
12: /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB.seq:* 
13 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq2 : * 
14 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 
15: /cgn2_6/ptodata/2/pubpna/USl0B_PUBCOMB.seq:* 
16: /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB.seq:* 
17: /cgn2_6/ptodata/2/pubpna/USl0_NEW_PUB.seq:* 
18 : /cgn2__6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
19: /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB.seq:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



No. Score Match Length DB ID 



Description 
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9 
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Sequence 
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13 


936.2 
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13 


us- 


-10- 
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Sequence 


3, Appli 


14 
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34 . 


0 


2860 


13 


us- 


■10- 
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Sequence 


1, Appli 


15 
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34. 


0 


2860 


13 


us- 


-10- 


-218- 


-779-3 


Sequence 


3, Appli 


16 
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2 
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13 


us- 
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-417-37 


Sequence 


37, Appl 


17 
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32 . 


8 
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9 
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Sequence 18, Appl 
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32 . 


8 
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Sequence 


145, App 
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32 . 


8 
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13 
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-10- 
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Sequence 


145, App 
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3884 


13 


us- 


-10- 


-160- 


-503-145 


Sequence 


145, App 


21 


902.4 


32 . 


, 8 


3884 


13 


us- 


-10- 


-143- 


-118-145 


Sequence 


J. fi J , '^PP 


22 


902.4 


32. 


, 8 


3884 


13 


us- 


-10- 


-144- 


-993-145 


Sequence 


145, App 


23 


902.4 


32. 


.8 


3884 


13 


us- 


-10- 


-158- 


-787-145 


Sequence 


145, App 


24 


902.4 


32, 


. 8 


3884 


13 


us- 


-10- 


-140- 


-024-145 


Sequence 


145, App 


25 


902.4 


32 . 


. 8 


3884 


13 


us- 


-10- 


-140- 


-808-145 


Sequence 


145, App 


26 


902.4 


32- 


, 8 


3884 


13 


us- 


-10- 


-152- 


-405-145 


Sequence 


145, App 


27 


902.4 


32. 


. 8 


3884 


13 


us- 


-10- 


-127- 


-852A-145 


Sequence 


145, App 


28 


902.4 


32 < 


. 8 


3884 


13 


us- 


-10- 


-127- 


-900A-145 


Sequence 


145, App 


29 


902.4 


32 . 


. 8 


3884 


13 


us- 


-10- 


-128- 


-685A-145 


Sequence 


145, App 


30 


902.4 


32 , 


. 8 


3884 


13 


us- 


-10- 


-131- 


-820A-145 


Sequence 


145, App 


31 


902.4 


32 


. 8 


3884 


13 


us- 


-10- 


-142- 


-886-145 


Sequence 


145, App 


32 


902.4 


32 


. 8 


3884 


13 


us- 


-10- 


-146- 


-728-145 


Sequence 


145, App 


33 


902.4 


32 


. 8 


3884 


13 


us- 


-10- 


-146- 


-786-145 


Sequence 


145, App 


34 


902.4 


32 


.8 


3884 


13 


us- 


-10- 


-147- 


-499-145 


Sequence 


145, App 


35 


902.4 


32 


. 8 


3884 


13 


us- 


-10- 


-157- 


-798-145 


Sequence 


14 5, App 


36 


902.4 


32 


. 8 


3884 


15 


us- 


-10" 


-028- 


-072-145 


Sequence 


145, App 


37 


902.4 


32 


. 8 


3884 


15 


us- 


-10- 


-121- 


-049-145 


Sequence 


145, App 


38 


n A O A 

90^.4 




p 

. D 


ft P /3 




US- 


-10- 


-123- 


-904-145 


Sequence 


145, App 


39 


902.4 


32 


.8 


3884 


15 


US- 


-10 


-140- 


-470-145 


Sequence 


145, App 


40 


902.4 


32 


.8 


3884 


15 


us 


-10- 


-175 


-746-145 


Sequence 


145, App 


41 


902.4 


32 


.8 


3884 


15 


US' 


-10 


-176 


-918-145 


Sequence 


14 5, App 


42 


902.4 


32 


.8 


3884 


15 


US' 


-10 


-176 


-921-145 


Sequence 


145, App 


43 


902.4 


32 


.8 


3884 


15 


us 


-10 


-137 


-865-145 


Sequence 


145, App 


44 


902. 4 


32 


.8 


3884 


15 


US 


-10 


-140 


-474-145 


Sequence 


145, App 


45 


902.4 


32 


.8 


3884 


15 


us 


-10 


-142 


-431-145 


Sequence 


145, App 



ALIGNMENTS 



RESULT 1 

US-09-918-779-1 

; Sequence 1, Application US/09918779 
; Publication No. US20030064369A1 
; GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Taupier, Raymond 
Padigaru, Muralidhara 
Luca 
Steven 
Richard 
Bryan 



Rastelli^ 
Spaderna, 
Shimkets, 
Zerhusen^ 
Spytek, Kiitiberly 
Shenoy, Suresh 
Li, Li 

Gusev, Vladimir 
Grosse, William 
Alsobrook, John 
Lepley, Denise 
Burgess , Catherine 
Gerlach, Valerie 
Ellerman, Karen 
MacDougall, John 
Stone, David 
Smithson, Glennda 
TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-074 US 
CURRENT APPLICATION NUMBER: US/09/918,77 9 
CURRENT FILING DATE: 2 001-07-30 
PRIOR APPLICATION NUMBER: 60/221,409 
PRIOR FILING DATE: 2000-07-28 
PRIOR APPLICATION NUMBER: 60/222,840 
PRIOR FILING DATE: 2000-08-04 
PRIOR APPLICATION NUMBER: 60/223,752 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,762 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,770 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,769 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/225,146 
PRIOR FILING DATE: 2000-08-14 
PRIOR APPLICATION NUMBER: 60/225,392 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,470 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,697 
PRIOR FILING DATE: 2000-08-16 
PRIOR APPLICATION NUMBER: 60/263,662 
PRIOR FILING DATE: 2001-02-01 
PRIOR APPLICATION NUMBER: 60/281,645 
PRIOR FILING DATE: 2001-04-05 
NUMBER OF SEQ ID NOS : 61 
SOFTWARE: Patentin Ver. 2.1 
SEQ ID NO 1 
LENGTH: 2752 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-918-779-1 



Query Match 100.0%; Score 2752; DB 13; Length 2752; 

Best Local Similarity 100.0%; Pred. No. 0; 



2752; Conservative 0; Mismatches 0; Indels 0; Gaps 



1 CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 6 0 



61 
121 



I I I I M I I M I I I I M M I I I I M I I I I I M M I I M I I I I I I I I I I I I I I 

CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 
GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 

I I I I I I M I I I M I I I M I I M I M I I I I I I I I I I M I I I M I I M I I I M I I I I I I I I 1 

GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 



120 



CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 

I I 1 1 I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I M I I I I I I I I I 1 1 I I 

121 CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 



181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I M I I I I I I I I I I M I I I I 

181 TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 24 0 



241 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 

I I I I I I I I I I M I I I I I I I I 1 I I I I I M M I I I I I I I I I N I I I I I I I I I M I I I I M M 

241 AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 



300 
300 



301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

I I I I I I I I I I I I I I I M I I I I M I I I I I I I I M I I I I I I I I I I I I I M I I I I I M 

301 GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 



420 



361 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 

I I I I I I M I I I I I I I I I I I I I I I I I I 1 I M I I t I I I I I I I I I I I M I I I I I I I I I I I I I I 
361 ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 



421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MINIM 

421 TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 



481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 

I I I I M 11 I II I 11 M M I I M I I II M M I I M I M M II I I I M I I I I M I II I M I I 

481 AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 



480 
540 
540 



541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 

I I I I I I I I I I I I I I I M II M II I M II M M II I M I II II I I M I II II I M M I II I 

541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 



600 
660 



601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 

I I I I I I I I I M II I M II I 11 I I I I I M M II II I I II II II II M I II I I I M II II II 
601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 



661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 

I I I M I I II I 1 I 1 M I I M II II I M II 1 I I II 1 I I II I I 1 I I M I M I I M 

661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 



720 

720 



721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 780 

IIMMIMMIMMMM 1 II I 1 II II I I I I II 1 I 1 II I I 1 I 11 I M M 1 I I I 

721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 780 
781 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 840 

ilMMIMMMM MMIIlllllll I Illllll 

781 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 840 



Ov 


841 


AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 

1 M 1 1 M 1 1 1 1 1 1 M 1 1 1 M 1 M 1 M 1 1 1 M 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 


900 


Db 


841 


900 


Ov 


901 


GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 


960 


Db 


901 


960 


Ov 


961 


AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 

1 1 1 1 M 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 


1020 


Db 


961 


1020 




1021 


GCACCCCGCTiACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 M 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 


1080 


Db 


1021 


1080 


Ov 


1081 


AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 M 1 1 1 1 1 1 M 1 M 1 1 M 1 1 1 1 1 1 1 I 1 1 M 1 1 1 1 1 

AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 


1140 


Db 


1081 


1140 


Ov 


1141 


ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 i 1 1 1 M M 1 1 1 1 1 1 1 1 

ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 


1200 


Db 


1141 


1200 


Ov 
s^y 


1201 


AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 

1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 M 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 


1260 


Db 


1201 


1260 


Ov 

yy 


1261 


CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 

1 1 1 1 M 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 N M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 


1320 


Db 


1261 


1320 


Ov 


1321 


CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 


1380 


Db 


1321 


1380 


Ov 
s^y 


1381 


CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 N 1 1 1 M 1 i 1 1 1 1 1 1 1 1 

CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 


1440 


Db 


1381 


1440 


Ov 
vy 


1441 


CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 N 1 M 1 1 M 1 1 

CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 


1500 


Db 


1441 


1500 


Ov 
vy 


1501 


AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 M 

AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 


1560 


Db 


1501 


1560 


Qy 


1561 


r^rrr' r^r-rrrrrrr'TriikTaATrrrTAAT AC AGGTATCAGCCTCCTCATCCCCCCAGATGCC 
1 1 M 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 M 

CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 


1620 


Db 


1561 


1620 


Qy 


1621 


ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGG 

1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 M 1 1 1 M 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M M 1 M N 1 1 1 
ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCAC/VAGCCGGAAGACGTGAGG 


1680 


Db 


1621 


1680 



Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 



TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 174 0 

IIIIIMIII I I I I I I I I M I I I I I I I M I Illlllll 

TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 



1740 
1800 



GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I M I I I M M I I I I M I I I I I I I I I M 
GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 1800 

AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 

I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I M I M I I I I I I I I M M I I I I I I I I I 

AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 
CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 192 0 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M M I I I I I I I I I I I 

CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 1920 

GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 198 0 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I 
GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 198 0 

GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 2040 

I M I I M I I I I I M I I I I I I I M I I I I I i I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 2040 

ATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAG 2100 

I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I 
ATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAG 2100 

AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 

CACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTT 2220 

I M M I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I M I M M I I I I I I I I I M I I I I I 

CACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTT 2220 

GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 2280 

I I I I I I I I I I I I I M I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 2280 

TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 2340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I M I I 
TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 2340 

GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 2400 

I I I I I I I M I I I I M I I I I I I I I I I I I I M I I M I I M I I I I I I I I I I I I I I I I M I I I I 

GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 2400 
ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 24 60 

I I I I I I I I I M I I I I I I I I I I I M M I I I I I M I M I I I I I I I I I I I I M I I I I I I I M I 

ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 2460 
CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

M I I I I I I I I I I I I I I I I I I I I M M I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATT^TTTCCAGCCTGGACCCA 2520 
2521 CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 258 0 



I I I I M I I I I I M M I I I I I I I I I I I I I M I I I 1 M N 1 I i N i I I I I N I I M I I I I I I 
Db 2521 CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 2580 

Qy 2581 CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 2640 

I I I I M I I I I I I I I I M I I I I I I I I I I I I I I M M I I I I I I I I I M I I M I I I I I I I M I 

Db 2581 CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 2640 

QV 2641 CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 

I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I i I I I I I I I M M M I I I I I I I I 

Db 2641 CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 

Qy 2701 CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 

I M I I I M I I M I I I I I I I I I I M I I I I I I I I I I N I I I I I I I I I I I I I I M 

Db 2701 CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 



RESULT 2 
US-10-624-932-1 

Sequence 1, Application US/10624932 
Publication No. US20040096877A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Taupier, Raymond 
Padigaru, Muralidhara 
Luca 
Steven 



Rastelli, 
Spaderna, 
Shimkets, Richard 
Zerhusen^ Bryan 
Spytek, Kiinberly 
Shenoy, Suresh 
Li, Li 

Gusev, Vladimir 
Grosse, William 
Alsobrook, John 
Lepley, Denise 
Burgess, Catherine 
Gerlach^ Valerie 
Ellerman, Karen 
MacDougall, John 
Stone, David 
xT-i. ^o^^^-".* - Smithson, Glennda 

TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-074 US 
CURRENT APPLICATION NUMBER: US/ 10/624 , 932 
CURRENT FILING DATE: 2003-07-21 
PRIOR APPLICATION NUMBER: 09/918,779 
PRIOR FILING DATE: 2001-07-03 
PRIOR APPLICATION NUMBER: 60/221,409 
PRIOR FILING DATE: 2000-07-28 
PRIOR APPLICATION NUMBER: 60/222,840 
PRIOR FILING DATE: 2000-08-04 
PRIOR APPLICATION NUMBER: 60/223,752 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,762 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,770 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,769 



PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/225,146 
PRIOR FILING DATE: 2000-08-14 
PRIOR APPLICATION NUMBER: 60/225,392 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,470 
PRIOR FILING DATE: 2000-08-15 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 61 
SOFTWARE: Patentin Ver. 2.1 
SEQ ID NO 1 
LENGTH: 2752 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-624-932-1 

Query Match 100.0%; Score 2752; DB 17; Length 2752; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2752; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 

GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 

I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I 

GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 120 
CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 

I I I I I M I I I I I M M M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 

TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 240 

AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 

I I I I I I M I I I I I I I I I I I I I I I I I M M I I I M M I I I I I I I I I I I I M I I I I I I I M I 

AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 
GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I M I M 

GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 
ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 

I M M I I M I I M I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 
TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

I I I I I I I I I M I I I I I I I I I 1 I I I I I I I I I I M I M I I I I I M I I I I M I I I M I I I I M 

TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 



I I I I I M I I I I I I I I I I I I M I I I I M I I I I I I I I I I I M I I I I I M I I I I I I I I M I I I 

AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 



540 



Qy 



541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 



541 ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 
601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 

I I M I I I I I I I I I I I M I M I I I I I M I I I M I I I I I I I I I I I I I N I I I I I I I I I I I I I 

601 AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 
661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 720 

M I I I I I I I I I I I M I I I I I I M I I I I I I M I I I I M I I I I M I I I I I I I I I I I I I I I I I 

661 CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 720 
721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 780 

M IIIIIIM IIMIIIIIIIIIIIIMIIMM I 

721 ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 78 0 

781 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGA7\ACGG 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 81 TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 84 0 

841 AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 90 0 

I I I I I I I I I I M I I I I I I M I I I I I I I I M M I I I M I I I I I I I I I I I I I I I I I M I I I I 

841 AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 900 

901 GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 960 

I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
901 GTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGC 960 

961 AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
961 AAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCA 102 0 

1021 GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 1080 

I I I M M I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 GCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACC 1080 

1081 AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 114 0 

I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I 
1081 AGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTC 1140 

1141 ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I M I I I I I I I I I I I I I I I I I 

1141 ATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG 1200 

1201 AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 12 60 

I I I I I M I I I I I I I 1 I I I I M I I I I I I I I I I M I I I I I I I I I I I 

1201 AAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAG 1260 

1261 CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 1320 

M I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

1261 CCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCGGAC 1320 
132 1 CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 138 0 

I I I I I I I I I I I M M I I I I M I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1321 CTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCCAGC 1380 
1381 CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 1440 

I I I I I I I M I I I I I I I I I I I I I M I I I I M I I I I I I M I I I I I I I I I I I I I I M I I I I I I 



Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 



CCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACA 14 4 0 
CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 

I I 1 I I I I I I I I I I I I I I I I I I I I M I M M I I I I I I I I M I I M I I M I I I I I I I I I M I 

CTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAG 1500 

AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 1560 

I M I I I I I I M I M I I I I I I I I I I M I I I I I I I I I I I I I I M I M I I I M I I I I I I I I I I 

AACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTC 1560 
CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 162 0 

I I M M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I M I I I I I I M I I I I I I 
CTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGATGCC 1620 

ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGG 1680 

I I I I I I M I I I M I I I I I I I I I I M I I I I I I I M I I I I I I M I M I I I I I I I I I M I I I I 

ATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTGAGG 168 0 

TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 1740 

I I I I I I I I I I I I M I I I I I I I I M I I I I i I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I M I 
TTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGC 1740 

GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 1800 

I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I M I I I M I M I I M I I I I I I M I I I I I 

GTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGAC 18 00 
AGCTGGAGCCTGCGCCTCAAT^GCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 

I I I I I M I I I I I I I I I M I I I I I I I I I I I I M I 1 M I I I I I I I I I I I M I I I I I I I I I I I 

AGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATGTGCTGCAC 1860 
CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I M I I I 

CTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTAC 192 0 
GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 1980 

I I I n I I I M I I M I I I I I I I I I I I I I I 1 I I I i I I I I I I I I I I I I I I I M M M M I I I I 

GTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCC 1980 
GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 2040 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I M 

GCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAAC 204 0 



I I I I I I M I I I I I I I I I I I I I I I M M I I I I I I 1 I I I I I M M I I I I I I I I I I I I I I I 



AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 
I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I M I I I I I I I I M I I I 
AAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTAC 2160 



I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 



GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 2280 

I I M M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I 
GTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCAC 228 0 



Ci\7 


^ ^ O J. 


Db 


2281 


vy 




Db 


2341 




9401 




2401 


Qy 


94 f^l 


UxJ 


2461 


Qy 


9 R9 1 




9 S9 1 


Qy 


0 1 

1 OD X 




^ «J U J. 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 



TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 

I I I I I I M I I I M I I M I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I M 

TGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGG 



2340 



2340 



2400 



GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 1 I I I I 
GTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACCAAGGAC 2400 

ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 2460 

M I I I I I I I I M I I I I I I I I I I I I I I I M I I M I I I I I I I M I I I I I M I I I I I I I I I I I 

ACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGC 24 60 

CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I 
CCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 2520 

CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 258 0 

I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I M I I I I I I I I I I 

CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCAT 2 5 8 0 



CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 
I I I I I I I I I M I M I I I I I M I I I n I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCG 



2640 



2640 



CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 27 00 

I I I I I I M I I I M I I I I I I I I I I I I M I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I 

CGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAG 2700 
CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 

I I I I M I M 1 I I I I I I I I I I I I I I I M I M M I M I I I I I I I I M I I I I I I I 

CCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 



RESULT 3 
US-09-970~944-l 

; Sequence 1, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 

; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patentin Ver. 2.1 

; SEQ ID NO 1 

LENGTH: 2881 
; TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-970-944-1 



Query Match 97.3%; Score 2676.4; DB 11; Length 2881; 

Best Local Similarity 98.9%; Pred. No. 0; 

Matches 2728; Conservative 0; Mismatches 21; Indels 9; Gaps 3; 

CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 60 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I 
CCGCGGGGCCCCGCGCCCGGCCCGCCCGCCTGCCCGCCCGCGGCCATGGCCGTCCGGCCC 101 

GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 12 0 

I I I I I M I I M I I M I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I 

GGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGCC 161 
CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 180 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I M I I I I M I M I I I I I I I I I I I 

CAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCAC 221 
TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

TTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGC 2 81 

AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 300 
I I I I I I I I I I I I I I I I I I I I I I M I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTG 341 

GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGC 360 

I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I III I I I I II I I I II II II 

GACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGTGAGCCGACCATGGAGGTCCGC 401 

ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 420 

I I I I M M I I I I I I I I II M I I II I I I I I I I I I M I I I II M I II II II I II II I II II I 
ATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAG 461 

TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 480 

M I I I I I II II I I I II I M I I II I I I I I I I I II II I I II II I M I I I II I I M I I I I II I 

TGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCC 521 

AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 540 

1 II I I I I I I I I I II I I I I I I I I I II II I I I I I I I I I I M I I I I I I I I I I I I II I I I M I I 
AGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGC 581 

ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 600 

I II I I I I I I M I I I I I M I I I I II M I II II II I I I I I I I I M I I M I I I I I I I I II I I I 

ATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGG 641 

AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 660 

I II II II I I I I I M I II I I I I I I I I I I I I I I I I M I I I I II I I I I I I II I I I I I I I I I I I 
AACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACAGC 701 

CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I M II I M I I I I I I I I M I I I I I I I I I I I I M I I I I I 

CTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAAC 761 

ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 780 

I II II M I I I I I I II I II I I I I I I I I I I I I I I I I I M 1 I I I I I I I I I I I I I I I I I I I I I I 
ATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGG 821 



wy 


1 


Db 


42 


yy 






102 




191 




162 


Qy 


1 p 1 

X O X 


L/JJ 


^ ^ ^ 


yy 


941 

^ X 


jjjj 


9 fi9 


Qy 


J U X 


Db 


342 


Qy 


o ox 




402 


yy 


4 91 


Db 


462 


r\\7 

Qy 


4 ftl 

1 O X 


nh 


522 


Qy 


R4 1 

•J rl X 




582 


Qy 


601 


Db 


642 


Qy 


661 


Db 


702 


Qy 


721 


Db 


762 



Qy 


781 


Db 


822 


Qy 


841 


Db 


882 


Qy 


901 


Db 


942 


Qy 


958 


Db 


1002 


Qy 


1018 


Db 


1062 


Qy 


1078 


Db 


1122 


Qy 


1138 


Db 


1182 


Qy 


1198 


Db 


1242 


Qy 


1258 


Db 


1302 


Qy 


1318 


Db 


1362 


Qy 


1378 


Db 


1419 


Qy 


1438 


Db 


1479 


Qy 


1498 


Db 


1539 


Qy 


1558 


Db 


1599 


Qy 


1618 



TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 

I I M I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGG 

AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAAT 



840 



881 



900 



941 



957 



GTCCAGAA AACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGG 

I I I I I I I I I I I I I II I I II I I I I I I I I II I M I I I I I II I 

GTCCATGACCGCACCGTCTCCTCTCTGCTTGTCTCTGTGGACGGCAGCTGGAGCCCGTGG 1001 

AGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGAC 1017 

I I I II M I I I M I I I I II I I I I I I I I II I I I II I I I I I I I I M I I M I I I II I I I I M II 

AGC7UVGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGAC 1061 
CCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGT 1077 

I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I M 

CCAGCACCCCGCTVACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGACACCCGCAACTGT 112 1 

ACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGC 1137 

I I I I II II I I I I I I I I I II I I I I I I I I M I M I I I I I I II I I I I I I I I I I I I I I I I I I I I 
ACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGC 1181 

CTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGC 1197 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
CTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGC 1241 

CGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTC 1257 

I II II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I M I I I I I M I I I 
CGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTC 1301 

CAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCG 1317 

I I I M I I M M I I I I I I I M I I I I I M M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

CAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACCATCCAGCCG 1361 

GACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCC 1377 
I II II I II I I I I I I I II II I M I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
GACCTCAG CACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGATGGGCCC 1418 

AGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCAC 1437 

I I I M I I I I I I I I I I M I I I I I I I I II 11 I I I I I I I M I I I I I I I I I I I I I M I I I I I I I 
AGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCAC 1478 

ACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACC 1497 

I I M I I I I I I I I I I M I I I I II I I I M I I I I I I I I I I I I I I M I M M I I I M I I I I I I I 

ACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACC 1538 

CAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAAC 1557 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I 
CAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTCAAC 1598 

TTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGAT 1617 

II I I II I M I I I I I I I I I I M I I I I I I II I I I I I I I I M M I I II I II II I I I I I II I I I 

TTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCAGAT 1658 



1618 GCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTG 1677 



Db 


1659 


Qy 


±0/0 


Db 


1719 


uy 


X / 0 0 




1779 


Qy 


1 7 Q ft 






Qy 


1 ft 


jjxj 


J. 0 .7 ^ 


Qy 






X ^ >j 7 


Qy 


ly / 0 


JJJL) 


^ \J X ^ 


Qy 


9 n 


JJD 




Qy 


^ u y 3 


UL) 


X 0 -7 


Qy 


9 1 S R 




2199 


Qy 


Z Z X o 




2259 


Qy 


9 9 7"^ 




2319 


Qy 


z 00 o 


Db 


2379 


Qy 


2395 


Db 


2439 


Qy 


2455 



I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I M I I M I 

GCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGACGTG 1718 
AGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCT 1737 

I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I M I I I I I I I I I M I I M I I I 

AGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCT 1778 
GGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCT 1797 

M I I I I I I M I I I I I I M I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I M I 

GGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCT 1838 

GACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGG AGGATGTG 1854 

I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGCAGGATGTG 1898 

CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCC 1914 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTGCC 1958 

TGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTG 1974 

I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

TGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTG 2018 

GCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAG 2034 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I M I I I I 

GCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAG 2078 



TACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAG 

I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
TACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGCAG 



2094 



2138 



2154 



CTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGAC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I M I 
CTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGGAC 2198 

AGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAG 2214 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I 

AGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTAAG 2258 
CTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTAC 2274 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 1 I M I N I I I I I I I I I I I I 

CTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTAC 2318 

TTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAG 2334 

I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I M I I I 

TTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAG 237 8 

CTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACC 2394 

I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 

CTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCACC 2438 

AAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTG 2 454 

I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
AAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGCCCTG 2498 

GTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTG 2514 

I I I I M I M I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 



Db 2499 



GTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTG 



2558 



Qy 2515 GACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGAC 2574 

M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I M M M M I I I M I I I I I I I I I 
Db 2559 GACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGAC 2618 

Qy 2575 AGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGG 2634 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 619 AGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGG 2 678 

Qy 2635 GAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTG 2694 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I M M M M I I I I I I I I I I I I 

Db 2679 GAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGGACTG 2738 

Qy 2695 GGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 2752 

I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I N I I M I I I I 

Db 2739 GGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCAG 27 96 
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Query Match 85.6%; Score 2356; DB 17; Length 3580; 

Best Local Similarity 93.6%; Pred. No. 0; 

Matches 2537; Conservative 0; Mismatches 5; Indels 168; Gaps 1; 

Qy 43 GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGG 102 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I 

Db 1 GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGG 60 

Qy 103 CTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAAC 162 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CTCCGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAAC 120 

Qy 163 CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAG 222 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 121 CCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAG 18 0 

Qy 223 CCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCT^CGGG 2 82 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGG 240 

Qy 283 GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 342 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTG 300 

Qy 343 CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAG7UVGGTGTTCGGGCTG 402 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTG 360 

Qy 403 GAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAG 462 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 GAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAG 420 

Qy 463 GCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAG 522 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GCCTACATCCGCATAGCCTATTTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAG 480 

Qy 523 GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 582 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCC 540 

Qy 583 GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 642 

I I I I I I I I I I M I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I M I I I I I 
Db 541 GAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATAC 600 

Qy 643 ATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCcAaCTAC 702 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I f I I I I I 

Db 601 ATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTAC 660 

Qy 7 03 ACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTC 7 62 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 ACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTC 720 

Qy 763 TACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGG 822 

I I I I I I 



726 

721 TACGTG 

823 CGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCT 882 

726 

727 

883 TTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACG^ 942 

GACGGC 732 

727 

943 AGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGC 1002 

I I 1 I M 1 M I I I I I I I I I 1 I M I I I 1 M I 1 I I I I I I I I I I M I I 1 I M I ' 

733 AGCTGGA( 
1003 CGTGAi 



.GCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGC 792 
.GTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTG 1062 



I I I I I I I I I I I 



IIIIIIIIIIIIIIMIIIIIIIIIIMIIIMIIIIIIIIII 



793 CGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTG 8 52 

1063 GACACCCGCAACTGTACCAGTGACCTCTGTGTAc4^AGTG|cTTCTGGCCCTGAGGACGTG 1122 

853 iACACCCGcJJ.CTGTACCAGTGACCTCTGTGTACAWGCTTCTGGCCCTGAGGAC 912 

GCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTC 1182 



1123 

I 1 I I I I I I I I 1 1 M M I I I I I 1 I I I I I I M I I I I I I I I I I I I I I I I I I 1 I 1 I I I I I I M . 

913 iiicTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTC 972 



1183 ATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATT 1242 
11 I 11 I 1 I 1 I I I I 1 I I 1 I I 1 I I I I I I I I liililliliiiliiiliiiUli ' 

973 
1243 



ATCCTCGTTTATTGCCGGAAGAAGGAGGGGCT^^^ 1032 
CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTG 1302 



M I I II I I M I II I I I 1 1 I 1 II I I I 1 II 1 I 1 I 1 I 1 M 1 11 I 1 1 1 I M I I 1 1 I I 1 I I I I 
1033 CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTG 

1303 CTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTC^ 

IIIIIIIIMIIIIMIIIIIIIIIIIIII IIIMIIMilil IIIMI 

1093 CTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCC 
1363 CGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTG 

lll|i|IIIMiMilllllllMMIIIIMIilllll I I I I I I I 1 I I 1 I M I I 

1153 CGGCAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTG 
1423 GGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTC 

IMIIIlllllllllllllllllllMIIMIIIIIIIMIIIIIIIIIIMIMllMI 
1213 GGTGGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTC 

1483 TclcCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACC 
IIIMIIIIMIIMIIIMIIIMIMIIIIIIIIMIIIIIIIMMIIIIIIIIIII 

1273 TCfCCGCCTCTCCACCCAGAAl 
1543 



1092 
1362 
1152 
1422 
1212 
1482 
1272 
1542 



.CTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACC 1332 

TATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTC 1602 

IIIMIIIIMIIIIIIIIIIMIIIMMIMIIIIIMIMMIIIII IIIMMII 

1333 TATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAAT^^^ 1392 

1603 CTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCAC 1662 

IMMMIMMMIIIMIIMIMMIIMMIIIIIIIIIMIIMIIIIIIIIIM 

1393 CTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGC^ 1452 



Qy 


1663 


Db 


1453 


Qy 


1723 


Db 


1513 


Qy 


1783 


Db 


1573 


Qy 


1843 


Db 


1633 


Qy 


1903 


Db 


1693 


Qy 


1963 


Db 


1753 


Qy 


2023 


Db 


1813 


Qy 


2083 


Db 


1873 


Qy 


2143 


Db 


1933 


Qy 


2203 


Db 


1993 


Qy 


2263 


Db 


2053 


Qy 


2323 


Db 


2113 


Qy 


2383 


Db 


2173 


Qy 


2443 


Db 


2233 



AAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTT 1722 

I I I I I I I I I I I I I I I I I I I I I M I I I I I i I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
AAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTT 1512 

AGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGT 1782 

II II II II II I M II II II II II II I II I II II II III II I II I I I II II I I II M I I II 
AGCTGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGT 1572 

GGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGC 1842 



I I II I I I I II II 



I I I I I I I I I M I I I I I I I I I I I I II I I I I I I M I I I I II I M I I I I I 



GGGGAGCCCAGC(^^CTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGC 1632 
TGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTG 1902 

I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I M II I M M I I I I I I I I I I I I I M 
TGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTG 1692 

GAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAG 1962 

I I I M I I I II M II I II I I I I I I I I I I I I I I II I I I M I I I II I I I I I I I I I I I I I M I I 
GAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAG 1752 

GCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGC 2022 

I I M I I I I I M II I I I I I I I I M I I M I I I I I I I I I 1 1 I I I I I I I I I I M I I I I I I I I I I 

GCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGC 1812 

ACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAG 2082 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M M I I I I I I II I I I I I I I I M 
ACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAG 1872 

GAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTG 2142 

I I M I I I I I M II II I I II I I I I I I I I M I I I I i I I I II M I M M II I I I I I I I I I I I I 

GAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTG 1932 
CACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTG 2202 

I I I M I I I I I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I M I I 

CACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTG 1992 

TGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGC 22 62 

M I I I I I I II M I I I I I I I I I I I I I I I I II I I I M I I I I I I I I II M II I I I I I I I I I I I 
TGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGC 2052 

ACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGAC 2322 

I II I II II II II I M I I I I II I I II I I I I I I I I I I I I I I I I I I M I I II M I I M I II I I 

ACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGAC 2112 

CTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAAC 2382 

I I II I I I I I I I I M I I I I I I I I I I I M I I I I I I M I I I I I I I M II I I I I I I I I I I I I I I 
CTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAAC 2172 

TTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 2442 

I I I II II I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I II I I I I I I I I II I I I 

TTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGG 2232 

GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 2502 

I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I M M 1 I M I I I I I I I I I I I I I I I I I 
GTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATA 2292 



Qy 2503 ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 2562 

I I I I I M I I I I I I I M I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 2293 ATTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAA 2352 

Qy 2563 CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 2622 

I I M I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 2353 CTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATC 2412 

Qy 2 623 CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 2682 

M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M M I I I I I M I I I I I I I I I I I I I I M 

Db 2 413 CTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCA 2472 

Qy 2683 GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2473 GTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2532 

Qy 2743 GGCCGGCCAG 2752 

I I I I I I I I I I 
Db 2533 GGCCGGCCAG 2542 
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Qy 4 6 ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 105 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill 

Db 1 ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 60 

Qy 106 CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 165 

II II I I I I I I I II I II I I I I I I I I I I I II II I I I I I I I I I II I I I I I I I I I I I I 

Db 61 CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 120 

Qy 166 GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 225 

II II II II I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 GACCTGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 180 

Qy 226 GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 285 

III II I I I I I I II I I 11 Mill I I II I I M I I I M I II M I I I II II II I II 

Db 181 GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 240 

Qy 28 6 TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 345 

Mill II II II M II I II II M II I II II II I Ml I II II I II MM 

Db 241 TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 300 

Qy 346 ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 405 

I I I M I I II II M I II II II II II I I I M I I I I I I II I I II II I I I I I I I II 
Db 301 ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 360 

Qy 406 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 4 65 

I M II I II II I I II M I II II I I II I II II I II II I I II M II II II I II II I I II I 

Db 361 GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 420 

Qy 466 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

I II I M M II III II I I I M I M I M I I II I M II I M I I I I I I I I I II III 
Db 421 TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 4 80 

Qy 526 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 585 

M II M II II Mill II II II II II II II II M II II I Mill III 

Db 481 TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 540 

Qy 586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 

I M M I II II I II II II I I I I II II I I I I II I II II II I I I I II II II I II I 
Db 541 GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 600 



Qy 



64 6 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 705 



Db 601 ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 660 

Qy 706 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 765 

II I M I I I I I I I I I I I I I I II II II I I II III I I I I II II I I I II II II 

Db 661 TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 72 0 

Qy 766 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 825 

I I I I I I I I I I I I II I I I II I I I I II I M I II I II M I I II II II I I II I II II II I I I 
Db 721 GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 780 

Qy 826 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 8 85 

I I I I I I M I I I M II I I I I I I I I I M I I I I I I I II I I I I I I M I I I II I I I I I II III 

Db 7 81 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 84 0 

Qy 886 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

I I I I I I I I I II I I I II I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I II II Ml 
Db 841 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 900 

Qy 946 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 1005 

Mill II I II II II II II II M II I I I I I M I I I M I II I I II II II I I I I I II 

Db 901 TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 960 

Qy 1006 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1065 

I II II I II II II II II II M II M II MM! II M II II I III II II M II I II 

Db 961 GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 1020 

Qy 1066 ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

I I M I I II II II I II II I I II I II II I I I II I M II I I I II II II I I I M II 

Db 1021 ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 1080 

Qy 1126 CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1185 

Mill I II I II I II I M I II II II II I II II II I II II I I II 
Db 1081 CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 1140 

Qy 1186 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

Ml I I II II II I I II I I I I II I I M II II I I I I I I I M M I M II I I I I III 

Db 1141 CTCATTTACTGTCGCAAGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 1200 

Qy 1246 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACi^ACCCCCATCTGCTC 1305 

II II I II M II II II I M II II II M II II II II I II M II II II II II II II I II I 

Db 1201 ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 1260 

Qy 1306 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

I II II II I M I I I II II I M I I I II M I I I I II I I I I I II I I I I I I I I III I II 

Db 1261 ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 1320 

Qy 1366 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

II II M II II II M I II M M II II I II I I II II I II II M II II II II II I I I 

Db 1321 CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 138 0 

Qy 1426 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1485 

I I II II M I II M I II II II II II II II I II I I I II M II II I II II I II II 
Db 1381 AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 14 4 0 

Qy 1486 CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCJ^ACATGACCTAT 1545 

II II M II II II II M II 11 II II II II I II II I I I II II II M M II II I II I 



Db 14 41 CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 1500 

Qy 154 6 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II II I I I I I M II I 

Db 1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 1560 

Qy 1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

II I I I I I I I I I M I I I M I I I M I I I I I I I I I I I I I I I I I I I M I I I II II I I I 

Db 1561 ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 1620 

Qy 1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

II II I I M I I I I I I I I I I I I I I I I I II I I M I M I I I I I I I I I I I M I I I I I I I I I I 
Db 1621 CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 1680 

Qy 1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

I I I I I I I I I I II II I M I I I I I I I I M I M I I I I I I I I II II I I I I I I I I I II I 

Db 1681 TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 1740 

Qy 1786 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 

I I I M I I I I I I II I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I IN 
Db 1741 GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 1800 

Qy 184 6 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I I I I I I I I I I I I I II I I II I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1801 GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 18 60 

Qy 1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

IN I I I I I I M I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1861 GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 1920 

Qy 1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 

I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I II II I II I I I I I II 
Db 1921 CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 198 0 

Qy 2026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

Mill I I M I I I I I I I I I I II I M I I I I I II I I I I I I I I M I II I I I I I I I I I 
Db 1981 TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 2040 

Qy 2086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

I I I I I I I I I I I I I I I I I I I II I I II I I I I I M I I I I I I I I I I I I I II I I I I I I I I I 

Db 2041 GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 2100 

Qy 2146 TTCT^GGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2205 

I I I I I I I I II I I I M I I I M II II II I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 2101 TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 2160 

Qy 2206 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2265 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 2161 AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 2220 

Qy 2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

I I I I III I M I I M I I I I I I I I I I I M I II I I III I I I I I I I I II II I I I I 
Db 2221 CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 2280 

Qy 2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I 
Db 22 81 GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 234 0 



Qy 2386 AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 2445 

I I I I I I I I M I I I I I I I I I I I I I M I I I II I I I I I I I M I I II I I I I I I I I I I I I 

Db 2341 AACATCACTAAGGACACAAGGTTTGCTGTSiATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 2400 

Qy 2446 CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2505 

II I II II II I I I I M I II I I I I I M M I M II M I I II II I M II I II M I I I I I M 

Db 2401 CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATC 2460 

Qy 2506 TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 2565 

MM M M II I I M M M M I I I II I II I I II II I Mill II II II II I I I 

Db 2461 GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 2520 

Qy 2566 CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2625 

I II I II II II II I II I I II II II II I I I II II I M I I II I I II II I M II II II I II I 

Db 2521 CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 2580 

Qy 2626 AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 2 685 

II II I II M II I I II II I II I I I I II I I M II I M I I II II II M M II I II III 

Db 2581 AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 264 0 

Qy 268 6 GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 2745 

II II I I I I I I II I Mill I II I II II I II II I I II M II II II II II I II I I I 
Db 2641 GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGAGAC 2700 

Qy 2746 CGGCCAG 2752 

I II II I 

Db 2701 CAGCCAG 2707 



RESULT 6 

US-10-256-702-1 

; Sequence 1, Application US/10256702 
; Publication No. US20030059859A1 
; GENERAL INFORMATION: 

; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 

NUMBER OF SEQUENCES: 8 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 

CITY: SAN FRANCISCO 

STATE: CALIFORNIA 

COUNTRY: USA 

ZIP: 94104 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 10/256, 702 

FILING DATE: 27-Sep-2002 



CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/933,261 
FILING DATE: 20-Aug-2001 
APPLICATION NUMBER: 08/808,982 
FILING DATE: <Unknown> 
ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 3014 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-10-256-702-1 

Query Match 82.1%; Score 2259; DB 15; Length 3014; 

Best Local Similarity 89.7%; Pred. No. 0; 

Matches 2 427; Conservative 0; Mismatches 280; Indels 0; Gaps 0; 
Qy 46 ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 105 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 60 

Qy 106 CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 165 

II II I I I I I I I I I I I II I I II II I II I I I I I I I I I I I I I II II I I II I I i I M I 

Db 61 CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 120 

Qy 166 GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 225 

I I I I I I M I I I I I I I I I I M I I I I I I I Mill I I I I I I II M I I II M I I I I II 

Db 121 GACCTGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 180 

Qy 226 GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCTiAGTGCAACGGGGAG 285 

III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 181 GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 24 0 

Qy 286 TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 345 

I I I II I M M I I I II I I I I I II II M I M I I I III I I I I I I II I I II 
Db 241 TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 300 

Qy 346 ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 405 

I I I I I I I M II I I I II II II II I I I I M I I M I Mill I I II I I I I I I I I I I 
Db 3 01 ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 360 

Qy 4 06 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 4 65 

I I I I II I II I I I I II I I I I II I I I I I II I I I I I I I II I I II II II I I I I I II I I I I I 
Db 3 61 GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 42 0 



Qy 



4 66 TACATCCGCATAGCCAGATTGCGCAAG7W!^CTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

I M II I I I M III II I I M I II I I I I I M I I M I I I M I M M I I I I II III 



Db 



421 TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 4 80 



Qy 52 6 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 585 

II I M I I I I I I II II II II II II II II II II II I I II I I I I M III 
Db 481 TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 540 

Qy 586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 

II II I I I I I I I II II II I I I I M I I II I I I I I I I M II I M I I I I I I I I I I I 

Db 541 GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 600 

Qy 646 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 705 

I I I M I M M I I I I I II II Mill M I I I II I I M II I I I I II I I I II I I I I II I 

Db 601 ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 660 

Qy 706 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 765 

II I I I I I I I I I II I I I I I I II I I I I I I I I III II I I II II II I I I II II 

Db 661 TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 72 0 

Qy 766 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 825 

I I I I I I I I I I M I I I I I I I II I I I I I I II I I I M I I I I I I I I I I I I II I I I I I I I II I 
Db 721 GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 780 

Qy 826 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCGCCTCTCAACGGGGGCGCTTTC 885 

I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I III 
Db 781 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 84 0 

Qy 886 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II II III 
Db 841 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 900 

Qy 946 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 1005 

I I I I I I II I I I I II I I I I I I I I I II I I I I II I I II I I I I I I I I I I I I I I I I I I I 
Db 901 TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 960 

Qy 1006 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1065 

I II I I I M I I M I I I M I I I I I I I I I I I I I I I M I I I I I I III I I I I I I I I I I I 
Db 961 GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 1020 

Qy 1066 ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

I I I I I I I I II I I I I I I II I I I I I I I I I MM II I I I I MM II II I II II I I 

Db 1021 ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 1080 

Qy 1126 CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1185 

Mill I II I II II I I I II II II I I M I I I M I II M II I M I 

Db 1081 CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 1140 

Qy 1186 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

III I II I M II M II I I I I II M I II II I I I I I I I II I II I II II I I II III 

Db 1141 CTCATTTACTGTCGCAAGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 12 00 

Qy 1246 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCATCTGCTC 1305 

II II I II II II I M II I II II I II II I II I M I M I II II II M II II M I II I I II 
Db 1201 ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 1260 

Qy 1306 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

I II M II I I I I II II I II I I M M I I I I I II I I I II I M I I M II II I III I II 

Db 1261 ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 1320 



Qy 1366 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 1321 CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 1380 

Qy 1426 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1485 

I I I I I I I I I II I I II I I II I II II II I I I I I I I I I I M I II I I I I II I I I I I 
Db 1381 AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 1440 

Qy 1486 CGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 1545 

I I I I I I I M I I I M I I M I I M II I I I M I M I I I I II I I I M I II I I II I I II 

Db 1441 CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 1500 

Qy 1546 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

I I I I M II I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I M II I I I I M I I I I I I 

Db 1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 1560 

Qy 1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

II II I II I I I I II I I I I I I II I I M I II I II II M I I I I I I I I I I I I I I I II I I 

Db 1561 ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 1620 

Qy 1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

II I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

Db 1621 CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 1680 

Qy 1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

I I I I I I II I I II I I I I I II I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 1681 TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 1740 

Qy 1786 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 1741 GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 1800 

Qy 1846 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

I I I I I I II I M I I I I I I M I I I I I I I II I I I I I M I I I I II I I I I I I I I I I I I I I 

Db 18 01 GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 18 60 

Qy 1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

III I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 1861 GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 1920 

Qy 1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 

I I II I I I M I I I I I I I I I II I I I I I I I III M I II I I I I I I II I I I I I I I I II 

Db 1921 CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 1980 

Qy 2026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

Mill I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I 
Db 1981 TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 2040 

Qy 2086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

I I M I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I II I I I I I I I I I 

Db 2041 GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 2100 

Qy 2146 TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 2205 

I I II I I I I I I I I II I I I I I I I I II II I I I I I I I I I I I I M I I I M I I I I I I I I I I 
Db 2101 TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 2160 



Qy 22 06 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2265 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill I I II I II I II I I I I I I 

Db 2161 AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 2220 

Qy 2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

II I I III I I I II I I I I I I I I II I I II I I I I I I III I II M I I I II I I I I II 

Db 2221 CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 2280 

Qy 2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

II I I I II II I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 2281 GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 2340 

Qy 2386 AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTG7VAGCGGGGGTC 2445 

II I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I II I I I I I I I I I I 
Db 2341 AACATCACTAAGGACACAAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 2400 

Qy 2446 CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2505 

II I I I I I I I I I M M I I I I II II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II 
Db 2401 CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATC 2460 

Qy 2506 TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 2565 

I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I M I I I I i I I I I 
Db 2461 GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGAAACTT 2520 

Qy 2566 CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2625 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I 
Db 2521 CACCTGGACAGCCATCTTAGCTTCTTTGCCTCCAAGCCCAGCCCTACAGCCATGATCCTC 2580 

Qy 2626 AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCT^CCTCAGCCAGCTGGCTGCAGCAGTG 2685 

I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I II M I I I M I I I I M I I III 

Db 2581 AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 2640 

Qy 2 68 6 GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGC 2745 

II I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I 

Db 2641 GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGAGAC 2700 

Qy 2746 CGGCCAG 2752 

I I I I I I 

Db 2701 CAGCCAG 2707 



RESULT 7 

US-10-240-154-15 

; Sequence 15, Application US/10240154 

; Publication No. US20030175741A1 

; GENERAL INFORMATION: 

; APPLICANT: Cochran et al. 

; TITLE OF INVENTION: SCHIZOPHRENIA RELATED GENES 

; FILE REFERENCE: CKFW-POl-006 

; CURRENT APPLICATION NUMBER: US/10/240,154 

; CURRENT FILING DATE: 2001-04-02 

; PRIOR APPLICATION NUMBER: PCT/GBOl/01486 

; PRIOR FILING DATE: 2001-04-02 

; NUMBER OF SEQ ID NOS : 34 

; SOFTWARE: Patentin version 3,2 

; SEQ ID NO 15 

LENGTH: 2697 



TYPE: DNA 

ORGANISM: Rattus sp . 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (1) . . (2697) 
US-10-240-154-15 

Query Match 81.8%; Score 2252.2; DB 15; Length 2697; 

Best Local Similarity 89.7%; Pred. No. 0; 

Matches 2419; Conservative 0; Mismatches 278; Indels 0; Gaps 0; 
Qy 4 6 ATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTC 105 

I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I M I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCCGTCCGGCCCGGCCTGTGGCCAGTGCTCCTGGGCATAGTCCTCGCCGCCTGGCTT 60 

Qy 106 CGCGGCTCGGGTGCCCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCG 165 

II II I I I I I I I I I I M I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 CGTGGTTCGGGTGCCCAGCAGAGTGCCACGGTGGCCAATCCAGTGCCCGGTGCCAACCCC 12 0 

Qy 166 GACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATGTGTACATCGTCAAGAACAAGCCA 225 

I II I II M I II I I M I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GACCTGCTGCCCCACTTCCTGGTAGAGCCTGAGGACGTGTACATTGTCAAGAACAAGCCG 180 

Qy 226 GTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAGTGCAACGGGGAG 285 

III II I I M I I I I I I I I I I I I I I I II I I I I I I I I I II I I I II I I I I I I I I I I 

Db 181 GTGTTGTTGGTGTGCAAGGCTGTGCCTGCCACCCAGATCTTCTTCAAGTGCAATGGGGAA 240 

Qy 28 6 TGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCC 345 

Mill II I I I I I I M II I I I II II I I I I I II I III I I I II I II I I I I 
Db 241 TGGGTCCGCCAGGTCGATCACGTAATTGAACGCAGCACCGACAGCAGCAGCGGATTGCCA 300 

Qy 346 ACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAG 405 

I I I I I I I M I I I I I II II II II I I I I I I I I I I I I I I I I Mill I I II M II I 

Db 301 ACCATGGAGGTCCGTATCAACGTATCGAGGCAGCAGGTAGAGAAAGTGTTTGGGCTGGAG 360 

Qy 406 GAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCC 4 65 

II I M I II I II II II II I I II II I II I I I I I I I I I I I I I I I II II I I II I II I I II I 

Db 361 GAATACTGGTGCCAGTGTGTGGCATGGAGCTCCTCGGGTACCACCAAAAGTCAGAAGGCC 420 

Qy 4 66 TACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGCAGGAGCCGCTGGCCAAGGAGGTG 525 

II I I II I I II III I I I M I I I I I I II I I I II I I M M I I I I I I II II I I III 
Db 421 TACATCCGGATTGCCTATTTGCGCAAGAACTTTGAGCAGGAGCCACTGGCCAAGGAAGTG 480 

Qy 526 TCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGCCGAG 5 85 

II I I M II M I I II I II II II II II II II II II Mill II I I I III 
Db 4 81 TCACTGGAGCAAGGCATTGTACTACCTTGTCGCCCCCCAGAAGGAATCCCCCCAGCTGAG 54 0 

Qy 586 GTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATC 645 

M II I I I M II I I I I I II M I I I I I M I I I I Mill II I I I M II I II I I I I 

Db 541 GTGGAGTGGCTTCGAAATGAGGACCTCGTGGACCCCTCCCTCGATCCCAATGTGTACATC 600 

Qy 646 ACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACC 7 05 

II II I I II I M I I II I I II I I I I I I I I I II I I II I II I I I II I M II II I II I II 
Db 601 ACGCGGGAGCACAGCCTAGTCGTGCGTCAGGCCCGCCTGGCCGACACGGCCAACTACACC 660 



Qy 



7 06 TGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCGCCTCCGCTGCTGTCATCGTCTAC 765 

II I M I II II I I M I M M II I I I I I I II III I I I I II M Mill II II 



Db 



661 TGTGTGGCCAAGAACATCGTAGCCCGTCGCCGAAGCACCTCTGCAGCGGTCATTGTTTAT 720 



Qy 7 66 GTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGC 825 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 GTGAACGGTGGGTGGTCGACGTGGACTGAGTGGTCCGTCTGCAGCGCCAGCTGTGGGCGT 780 

Qy 826 GGCTGGCAGAAACGGAGCCGGAGCTGCACCJ\ACCCGGCGCCTCTCAACGGGGGCGCTTTC 885 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I III 

Db 781 GGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGGCACCTCTCAACGGGGGCGCCTTC 84 0 

Qy 886 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACCCTGTGCCCAGTAGACGGCAGC 945 

M I I I I II I I M I II I II I I I I II II I I I I II M I I M M I I M I I I I I II II III 
Db 841 TGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCACTCTGTGCCCAGTGGATGGGAGC 900 

Qy 946 TGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGGACTGCACCCACTGGCGGAGCCGT 1005 

Mill II I I I II I M I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I 

Db 901 TGGAGTTCGTGGAGTAAGTGGTCAGCCTGTGGGCTTGACTGCACCCACTGGCGGAGCCGC 960 

Qy 1006 GAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTGACCTGGAC 1065 

I I I I I I I M I II M I I I II I I M I I I I I I M I I I I II I I I III I I I I I I I I I I I 
Db 961 GAGTGCTCTGACCCAGCACCCCGCAATGGAGGTGAGGAGTGTCGGGGTGCTGACCTGGAC 1020 

Qy 1066 ACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCC 1125 

I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 ACCCGCAACTGTACCAGTGACCTCTGCCTGCACACCGCTTCTTGCCCCGAGGACGTGGCT 108 0 

Qy 1126 CTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATC 1185 

Mill I II II I II M II II I M II I I I M II II I M I I I II I 

Db 1081 CTCTACATCGGCCTTGTCGCTGTGGCTGTGTGCCTCTTCTTGCTGTTGCTGGCCCTTGGA 1140 

Qy 1186 CTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTC 1245 

III II II II II II I M I II I II I I I II I I I I I II I II I M II M M I I I III 

Db 1141 CTCATTTACTGTCGCAAGAAGGAAGGGCTGGACTCCGATGTGGCCGACTCGTCCATCCTC 12 00 

Qy 1246 ACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAGACj^ACCCCCATCTGCTC 1305 

II I II II II II M II I II M II I II I II II II II I I II I II II II II II II M I II I 

Db 1201 ACCTCGGGCTTCCAGCCTGTCAGCATCAAGCCCAGCAAAGCAGACAACCCCCACCTGCTC 1260 

Qy 1306 ACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 1365 

I II II II II II II II I I II II I I M I M I I II II I II I II I II I I II I Ml I II 

Db 12 61 ACCATCCAGCCAGACCTCAGCACCACCACTACCACCTACCAGGGCAGTCTATGTTCGAGG 132 0 

Qy 1366 CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGT 1425 

M II M I I M M M II I II II II I II II I I Mill II II II I M II II I Mill 

Db 1321 CAGGATGGACCCAGCCCCAAGTTCCAGCTCTCTAATGGTCACCTGCTCAGCCCACTGGGG 1380 

Qy 1426 GGCGGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCC 1485 

I II II II M II II I II I M I II II II II II I M I II I I II II I II II I I II I 
Db 1381 AGTGGCCGCCATACGTTGCACCACAGCTCACCCACCTCTGAGGCTGAGGACTTCGTCTCC 1440 

Qy I486 CGCCTCTCCACCCAG7VACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTAT 1545 

I I II II II II II II M I II II I II II II II I II I I II II I II II II II II II II 
Db 1441 CGCCTCTCCACCCAAAACTACTTTCGTTCCCTGCCCCGCGGCACCAGCAACATGGCCTAC 1500 

Qy 1546 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTC 1605 

I II II II II I II II M I II I II II I II II I II II II I II I I II I II I I II II II I M I 

Db 1501 GGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACGGGGATCAGCCTCCTC 1560 



Qy 1606 ATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAG 1665 

II Mill I I I I I I I I I I I I I II I I II I I I M I I I I M I I I I I I I I I I I II I I I I 
Db 1561 ATACCCCCGGATGCCATCCCCCGAGGAAAGATCTACGAGATCTACCTCACACTGCACAAG 1620 

Qy 1666 CCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGC 1725 

II I I I I I I I II I I I II M I I I II II I I II I I I II I M II I II II I I I I I I I M I I I I 
Db 1621 CCAGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCAGTCGTTAGC 1680 

Qy 1726 TGTGGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGG 1785 

I I I I I Mill II I I M II II M I I M I I M I M M I I I M I M I II II II I I II 

Db 1681 TGTGGGCCCCCAGGAGTCCTGCTCACCCGGCCAGTCATCCTTGCAATGGACCACTGTGGA 1740 

Qy 178 6 GAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGG 1845 

II I II II II I II M II I II M II II II I I I II II M I I I II II M I M i II II I III 

Db 1741 GAGCCCAGCCCTGACAGCTGGAGTCTGCGCCTCAAAAAGCAGTCCTGCGAGGGCAGTTGG 1800 

Qy 184 6 GAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAG 1905 

II I I II II II II II I II II II II M I II I M M I I II I II M I I I M II II I M I 

Db 18 01 GAGGATGTGCTGCACCTTGGTGAGGAGTCACCTTCCCACCTCTACTACTGCCAGCTGGAG 18 60 

Qy 1906 GCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCC 1965 

II I I I II II M I II II I II I I I I II I I II I M I M I M II II II I I M II I I M I 

Db 1861 GCCGGGGCCTGCTATGTCTTCACGGAGCAGCTGGGCCGCTTTGCCCTGGTAGGAGAGGCC 1920 

Qy 1966 CTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACC 2025 

II I M M M I II II I II II II II I I M III II M II II II I II I M M I M M 

Db 1921 CTCAGCGTGGCTGCCACCAAGCGCCTCAGGCTCCTTCTGTTTGCTCCCGTGGCCTGTACG 1980 

Qy 2 026 TCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAG 2085 

II I I I I II II II II I II II M M I II II I II I I I II M II II II II II I I M I 
Db 1981 TCCCTTGAGTACAACATCCGAGTGTACTGCCTACACGACACCCACGACGCTCTCAAGGAG 2040 

Qy 2086 GTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCAC 2145 

II II I M II I M II I II II I II I II I I II II II I I II II II M II II II II II II I 
Db 2 041 GTGGTGCAGCTGGAGAAGCAGCTAGGTGGACAGCTGATCCAGGAGCCTCGCGTCCTGCAC 2100 

Qy 214 6 TTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGG 22 05 

II II I I II M II II II II II II II M I II I II I II II II II II M II I II I I II I 
Db 2101 TTCAAAGACAGTTACCACAACCTACGTCTCTCCATCCACGACGTGCCCAGCTCCCTGTGG 2160 

Qy 2206 AAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACG 2265 

II I II I M M II M I II I II M I I I M I I II II I II II I I II M I I II I Mill 

Db 2161 AAGAGCAAGCTACTTGTCAGCTACCAGGAGATCCCTTTTTACCACATCTGGAACGGCACC 222 0 

Qy 2266 CAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTG 2325 

I I I I III II II I II II I I II M I II I II M 11 III I 11 II II I 11 I I I II I 

Db 2221 CAGCAGTATCTGCACTGCACCTTCACCCTGGAGCGCATCAACGCCAGCACCAGCGACCTG 2280 

Qy 2326 GCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTC 2385 

II I I II I II M M I M I M M II II II I I I I II II I II II II I II I M I M I M I I 

Db 2281 GCCTGCAAGGTGTGGGTGTGGCAGGTGGAGGGAGATGGGCAGAGCTTCAACATCAACTTC 2340 

Qy 2386 AACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTC 2445 

II II II II II I II II I M II II II I II I II M I II II M M M II I M I II I I I I 

Db 2341 AACATCACTAAGGACACAAGGTTTGCTGAATTGTTGGCTCTGGAGAGTGAAGGGGGGGTC 2400 



Qy 


2446 


Db 


2401 


Qy 


2506 


Db 


2461 


Qy 


2566 


Db 


2521 


Qy 


2626 


Db 


2581 


Qy 


2686 


Db 


2641 



CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATT 2505 

I I I I I I I I I I I I Ml I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I M 

CCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAAAAGATCATC 2460 
TCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTC 2565 

MM M M M M M M M M M M I M M M M I I M M I M M M M M I 
GCCAGTCTGGACCCACCCTGCAGCCGGGGCGCCGACTGGAGAACTCTAGCCCAGA75ACTT 2520 

CACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTC 2625 

M M M I M I M M M I M M M M M M M M M M M M M M M M M I M M M 

CACCTGGACAGCCATCTTAGCTTCTTTGCCTCC7UVGCCCAGCCCTACAGCCATGATCCTC 2580 
AACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTG 2685 

M M I M M M M M M M M M M M M M M M M M M M M M M M I Ml 

AACCTATGGGAGGCACGGCACTTCCCCAACGGCAACCTCGGCCAGCTGGCAGCAGCTGTG 2640 

GCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

M I M M M M M M M I M M M M M M M M M M M M I M M I Ml 

GCCGGACTGGGCCAACCAGATGCTGGCCTCTTCACGGTGTCGGAGGCCGAGTGTTGA 2697 
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TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1787 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-09-933-261-2 

Query Match 56.8%; Score 1562.4; DB 10; Length 1787; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 1661; Conservative 0; Mismatches 16; Indels 9; Gaps 8; 

GCAACTGTACCAGTGACCTCTGTGTACAdAGTGCTTCTGGCCCTGAGGACGTGGCCCTCT 1129 

I I I I I I I I I I I I I I Ill Mlim II IIMIIIMMIIIIMIIIIMIII 

GCAACTGTACCAGTGACCTCTG-GTACAO^CTGjCTTCTGGCCCTGAGGACGTGGCCCTCT 59 

ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 1189 

M I I I I I II I I I I I I I M I I II I I I I I I II I I I I I I II I M I II I I I I I I M I I I I I I II 
ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 119 

TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 1249 

II II II II M II II II I II II II II I I I I I I I II I II I I I I II I I I I I I I I I I I I II I I I 
TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 17 9 

CAGGCTTCCAGCCCGTCAGCATC-AAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 1308 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I II I I I I 
CAGGCTTCCAGCCCGTCAGCATCTAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 239 



Qy 


1070 


Db 


1 


Qy 


1130 


Db 


60 


Qy 


1190 


Db 


120 


Qy 


1250 


Db 


180 


Qy 


1309 


Db 


240 


Qy 


1369 


Db 


300 


Qy 


1429 


Db 


360 


Qy 


1489 


Db 


420 


Qy 


1549 


Db 


480 


Qy 


1609 


Db 


540 



1368 



II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I 



GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 1428 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 1 I I I I I 
GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 359 

GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 14 8 8 



M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I 



1 1 



GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCdCGC 419 

CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 1548 

II I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 479 

ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATC 160 8 

IIIIIIIIIMMIIIIIIIIIMIIIIIIIIIIIIIIIIIIII I I I I I I I I I I I I I II 
ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATC 539 

CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 1668 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I 
CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 599 



Qy 



1669 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 172 8 



Db 


600 


Qy 


1729 


Db 


660 


Qy 


1789 


Db 


720 


Qy 


1849 


Db 


779 


Qy 


1909 


Db 


838 


Qy 


1969 


Db 


898 


Qy 


2029 


Db 


958 


Qy 


2089 


Db 


1018 


Qy 


2149 


Db 


1077 


Qy 


2209 


Db 


1135 


Qy 


2269 


Db 


1195 


Qy 


2329 


Db 


1255 


Qy 


2389 


Db 


1 JlD 


Qy 


2449 


Db 


1375 


Qy 


2509 



I M I I I I 11 I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I N I I I I I I 

GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 659 
GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 178 8 

I I I I I M I I I I I M I I I I I I I I I I I M M M I I I I M I M I I I I I I I I I I I M I I I I I I I 

GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 719 
CCCAGCCCTGACA aCTGGAGCCTGCGCCTCAi^AAAGCAGTCGTGCGAGGGCAGCTGGGAG 184 8 

I I I I I I I I I I I I I M M I I I I I I I I M I I I I I I I I I I I I I M I I M M I I I I I I I I I 

CCCAGCCCTGACT^ GCTGGAGCCTGGCCCTCAAAAAGCAGTCGTGCGAGGG-AGCTGGGAG 778 
GATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 1908 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I 

GATGT-CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 8 37 

AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 1968 

I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I 
AGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTC 897 

AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 202 8 

I I I M I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I M I I I I I I I I I I I I 

AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 957 

CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 2088 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I i I I I I i 
CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 1017 

GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTC 2148 

I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I M I I M I I I I I I 

GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTT- 107 6 

AAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAG 2208 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

AAGGACAGTTACCACAACCT— GCCCTATCATCCACGATGTGCCCAGCTCCCTGTGGAAG 1134 

AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 2268 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 1194 

CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 2328 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I 
CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 1254 

TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 2388 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M M I I I I I I I I I I I I I I I I I I I I I 

TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 1314 

ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 2448 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I 11 I 1 
ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 1374 

GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 2508 

I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I M M M I M M I M I I I I I 

GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 14 34 

AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 2568 
I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 1 



Db 



1435 AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGT^CTCCAC 1494 



Qy 2569 CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 2628 

I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 1495 CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 1554 

Qy 2629 CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 2688 

I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I 

Db 1555 CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 1614 

Qy 268 9 GGACTGGGCCAGCCAGACGCTGGCCTC-TTCACAGTG-TCGGAGGCTGAGTGCTGAGGCC 2746 

II M II I M I I I I II II I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I 

Db 1615 GGGACTGGCCAGCAGGACGGTGGCTTCTTTCACAGTGTTCGGAGGCTGAGTGCTGAGGCC 1674 

Qy 2747 GGCCAG 2752 

I I I I II 

Db 1675 GGCCAG 1680 



RESULT 9 
US-10-256-702-2 

; Sequence 2, Application US/ 102567 02 
; Publication No. US20030059859A1 
; GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

; TITLE OF INVENTION: Netrin Receptors 

NUMBER OF SEQUENCES: 8 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 
; CITY: SAN FRANCISCO 

STATE: CALIFORNIA 
; COUNTRY: USA 

; ZIP: 94104 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: PatentIn Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 10/2 56, 7 02 

FILING DATE: 27-Sep-2002 
; CLASSIFICATION: <Unknown> 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/933, 2 61 

FILING DATE: 20-Aug-2001 
; APPLICATION NUMBER: 08/808,982 

; FILING DATE: <Unknown> 

ATTORNEY/AGENT INFORMATION: 
; NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 
; REFERENCE/ DOCKET NUMBER: UC96-217 

; TELECOMMUNICATION INFORMATION: 



TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1787 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-10-256-702-2 

Query Match 56.8%; Score 1562.4; DB 15; Length 1787; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 1661; Conservative 0; Mismatches 16; Indels 9; Gaps 8; 

Qy 1070 GCAACTGTACCAGTGACCTCTGTGTACACAGTGCTTCTGGCCCTGAGGACGTGGCCCTCT 1129 

M I I I M I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 1 GCAACTGTACCAGTGACCTCTG-GTACACACTGCTTCTGGCCCTGAGGACGTGGCCCTCT 59 

Qy 1130 ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I N I I I I I I I I 
Db 60 ATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCG 119 

Qy 1190 TTTATTGCCGGAAGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 1249 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I M M M M I I I I I 

Db 120 TTTATTGCCGG7UVGAAGGAGGGGCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCT 17 9 

Qy 1250 CAGGCTTCCAGCCCGTCAGCATC-AAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 1308 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I 
Db 180 CAGGCTTCCAGCCCGTCAGCATCTAAGCCCAGCAAAGCAGACAACCCCCATCTGCTCACC 239 

Qy 1309 ATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 1368 

I I I I M I I I I I I M I I I I I I I I I I I I I I I I 1 I I I I I I M I M I I I I I M M I I I I I I I I I 

Db 240 ATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAG 299 

Qy 1369 GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 1428 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I 1 I M 
Db 300 GATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGC 359 

Qy 1429 GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 1488 

I I M I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I M I I M I I I I M M I I I I I I I I I I 

Db 360 GGCCGCCACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGC 419 

Qy 1489 CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 1548 

I I I M I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I 1 I I I 

Db 420 CTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGG 479 

Qy 154 9 ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATC 1608 

I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I M I I 

Db 480 ACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATC 539 

Qy 1609 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 1668 

I I I I I I I I I I I I M I I I I I I I M I I I I I I I I M I I I I I I 1 I I I I I I I I I I I I I I M I I i I 
Db 540 CCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCG 599 



Qy 



1669 GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 172 8 



Db 


600 


Qy 


1729 


Db 


660 


Qy 


1789 


Db 


720 


Qy 


1849 


Db 


779 


Qy 


1909 


Db 


838 


Qy 


1969 


Db 


898 


Qy 


2029 


Db 


958 


Qy 


2089 


Db 


1018 


Qy 


2149 


Db 


1077 


Qy 


2209 


Db 


1135 


Qy 


2269 


Db 


1195 


Qy 


2329 


Db 


1255 


Qy 


2389 


Db 




Qy 


2449 


Db 


1375 


Qy 


2509 



I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I M M I I I I I I 

GAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGT 659 
GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 1788 

I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I M I M I I I I I I M M I I I I M I I I I I I I 

GGACCCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAG 719 

CCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAG 184 8 

M I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I 11 I I I I I I M I I I I I I I I I I I 
CCCAGCCCTGACAGCTGGAGCCTGGCCCTCAAAAAGCAGTCGTGCGAGGG-AGCTGGGAG 778 

GATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 1908 

Mill M I M II I I M I I I I I I I I I I I I I II I I I I I I I I I II II I I I M I II I I I I I M 

GATGT-CTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCC 837 



II I II I II I I I II I I I I I I I II I I I I II I I I I I I I M I I I I M I I I II I I I I I I I M II 



AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 202 8 

I I I I I I I II II I I I M II I II I I I M I I I I M II M I II I II I I I I I I I I I I I I I I I I I I 

AGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCC 957 

CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 2088 

M I II II I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I II I I I 
CTCGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTG 1017 

GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTC 2148 

I I II I I II I I I I I I I I I I I I II I I I I I I I I I M I I I I II I I I I II I I I I I I I I M I I I I 

GTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTT- 107 6 

AAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAG 2208 

I I II I I I I I I I I II I I I I M III I 1 M I M I I I I I I I I I I I I I I I I I I I I I I I I 
AAGGACAGTTACCACT^CCT — GCCCTATCATCCACGATGTGCCCAGCTCCCTGTGGAAG 1134 

AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 2268 

II I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I II I I I I II II I I I I I I I I II I I I 

AGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAG 1194 

CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 2328 

I II I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
CGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCC 1254 

TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 238 8 

II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I II I I 

TGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAAC 1314 

ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 244 8 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCA 1374 

GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 2508 

II I II I I I I I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II 
GCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCC 1434 

AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 2568 
I II I I I I I I I I I II M I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 


1435 


AGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCAC 


1494 


yy 




CT G GAG AG C C AT C T C AG CT T CT T T GC CT C CAAG C C GAG C C C C AC AGC C AT GAT C CT C AAC 


2628 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1495 


CTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAAC 


1554 


yy 




CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 


2688 




1 1 M 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 




Db 


1555 


CTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT 


1614 


Qy 


2689 


GGACTGGGCCAGCCAGACGCTGGCCTC-TTCACAGTG-TCGGAGGCTGAGTGCTGAGGCC 


2746 




II II II 1 II II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 
II 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 




Db 


1615 


GGGACTGGCCAGCAGGACGGTGGCTTCTTTCACAGTGTTCGGAGGCTGAGTGCTGAGGCC 


1674 


Qy 


2747 


GGCCAG 2752 




Db 


1675 


1 1 1 1 1 1 

GGCCAG 168 0 





RESULT 10 
US-10-296-115-365 

Sequence 365, Application US/10296115 
Publication No. US20040053248A1 
GENERAL INFORMATION: 
APPLICANT: Hyseq Inc 

TITLE OF INVENTION: No. US20040053248Alel Nucleic Acids and Polypeptides 
FILE REFERENCE: 784PCT 

CURRENT APPLICATION NUMBER: US/10/296,115 
CURRENT FILING DATE: 2002-11-18 
PRIOR APPLICATION NUMBER: US09/488,725 
PRIOR FILING DATE: 2000-01-21 
PRIOR APPLICATION NUMBER: US09/552,317 
PRIOR FILING DATE: 2000-04-25 
NUMBER OF SEQ ID NOS : 147 8 
SEQ ID NO 365 
LENGTH: 1321 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-296-115-365 

Query Match 43.8%; Score 1206.6; DB 13; Length 1321; 

Best Local Similarity 98.0%; Pred. No. 1.6e-296; 

Matches 1295; Conservative 0; Mismatches 19; Indels 7; Gaps 7; 

CACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCC 1494 

I I I I 11 I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 
CACACACTGCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCC 60 

ACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTC 1554 

I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I M I I I I I I N I I I I I I I I I I I I I I I I I I 
ACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCTTC 120 

AACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCCCA 1614 

M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
AACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATCAGCCTCCTCATCCCCCCA 180 

1615 GATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGAC 1674 



Qy 


1435 


Db 


1 


Qy 


1495 


Db 


61 


Qy 


1555 


Db 


121 


Qy 


1615 



Db 


181 


GATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGAC 


240 


Qy 


1675 


GTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCC 

IIIMIIIIIMIMIIIIIIIIMIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIII 

GTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCC 


1734 


Db 


241 


300 


Qy 


1735 


CCT-GGCGTCCTGCTCACCCGGCCAGTCATCCT-GGCTATGGACCACTGT-GGGGAGCCC 

III 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCTGGGCGTCCTGCTTACCCGGCCAGTCATCCTGGGGTATGGACCACTGTGGGGGAGCCC 


1791 


Db 


301 


360 


Qy 


1792 


AGCCCTGACAGCT-GGAGCCTGCGCCTCAA/s^GCAGTCGTGCGAGGGCAGCTGGGAGGA 

II 1 1 1 II M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 

AGCCCTGACAGCTGGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGA 


1850 


Db 


361 


420 


Qy 


1851 


TGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAG 

II 1 1 1 11 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 
TGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAG 


1910 


Db 


421 


480 


Qy 


1911 


TGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAG 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 
TGCCTGCTACGTCTTCACCGAGCAGCTGAGCCGCTATGCCCTGGTGGGAGAGGCCCTCAG 


1970 


Db 


481 


540 


Qy 


1971 


CGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCT 

MIIIIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCT 


2030 


Db 


541 


600 


Qy 


2031 


CGAGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGT 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
CGAGTACAACATACTGGTCTACTGCCTGCATGACACTCACGATGCACTCAACGTAGTGGT 


2090 


Db 


601 


660 


Qy 


2091 


GCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIMIIIIIIIIIII III lllllllllll 
GCAGCTGGAGAAGCAGCTGCAGGGACAGCTGATCCAGGAGCCACTGGTACTGCACTTCAA 


2150 


Db 


661 


720 


Qy 

Db 


2151 
721 


GGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAG 

1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 M 

GGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAG 


2210 
780 


Qy 


2211 


TAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCG 

II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II 
TAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCG 


2270 


Db 


781 


840 


Qy 


2271 


GTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTG 

1 II 1 1 II 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 M 1 1 1 1 1 1 1 II 1 II 1 1 1 1 M 1 1 1 1 1 

GTACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTG 


2330 


Db 


841 


900 


Qy 


2331 


C7UVGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACAT 

1 1 1 1 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 11 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 
CAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACAT 


2390 


Db 


901 


960 


Qy 

Db 


2391 
961 


CACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGC 

II 1 II 1 II II II 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAGC 


2450 
1020 


Qy 


2451 


CCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAG 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 


2510 



Db 



1021 CCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAG 1080 



Qy 


2511 


Db 


1081 


Qy 


2571 


Db 


1141 


Qy 


2631 


Db 


1201 


Qy 


2690 


Db 


1261 


Qy 


2748 


Db 


1321 



CCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCT 257 0 

M 1 I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I M I I 

CCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCT 1140 

GGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCT 2 630 

I M I I I M I M I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I 

GGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACCT 1200 
GTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCT-G 2 68 9 

I M I I I I 1 I I I M I I M I I I I I I I I I I I M I I I I I I I I I I I I I M M I I I I I I I I I I I I 

GTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTGG 1260 
GACTGGGCCAGCCAGACGCTGGCCTC-TTCACAGTG-TCGGAGGCTGAGTGCTGAGGCCG 2747 

I M I I I I I I I I I MM MM M M M M M I M M M M M M M M M M M I 

GACTGGGCCAGCAGGACGGTGGCTTCTTTCACAGTGTTCGGAGGCTGAGTGCTGAGGCCG 132 0 



RESULT 11 

US-09-833-381-1806 

; Sequence 1806, Application US/09833381 

; Patent No. US20020132090A1 

; GENERAL INFORMATION: 

; APPLICANT: Robison, Keith E. 

; TITLE OF INVENTION: No. US20020132090Alel Nucleic Acid and Protein Homoiogs 
; FILE REFERENCE: 5800-119 

; CURRENT APPLICATION NUMBER: US/ 09/833, 381 

; CURRENT FILING DATE: 2 001-04-11 

; PRIOR APPLICATION NUMBER: 09/516,448 

; PRIOR FILING DATE: 2000-02-29 

; NUMBER OF SEQ ID NOS : 2050 

; SOFTWARE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 1806 

LENGTH: 1282 
; TYPE : DNA 

ORGTVNISM: Homo sapiens 

FEATURE: 
; NAME/KEY: misc_f eature 

LOCATION: (1)...(1282) 
; OTHER INFORMATION: n = A,T,C or G 
US-09-833-381-1806 



Query Match 42.0%; Score 1155.8; DB 9; Length 1282; 

Best Local Similarity 98.2%; Pred. No. 1.3e-283; 

Matches 1263; Conservative 0; Mismatches 13; Indels 10; Gaps 



Qy 


1469 


CCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCA 

M M M M M M M M M 1 M M M 1 M 1 M 1 M M 1 M 1 M M M 1 M M 

CCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGCCCCGAGGCA 


1528 


Db 


1 


60 


Qy 


1529 


CCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATA 

M M M M M M M M M M M 1 M M M M M M M M M M M M M M M M M M 1 


1588 



Db 


61 


CCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATA 


120 


Qy 


1589 


CAGGTATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCT 

Mil 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 

CAGGAATCAGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCT 


1648 


Db 


121 


180 


Qy 


1649 


ACCTCACGCTGCACAAGCCGGAAGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

ACCTCACGCTGCACAAGCCGG/yVGACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGC 


1708 


Db 


181 


240 


Qy 

Db 


1709 
241 


TGAGTCCCATCGTTAGCTGTGGACCCCCT-GGCGTCCTGCTCACCCGGCCAGTCATCCT- 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TGAGTCCCATCGTTAGCTGTGGACCCCCTGGGCGTCCTGCTCACCCGGCCAGTCATCCTG 


1766 
300 


Qy 


1767 


GGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCT-GGAGCCTGCGCCTCAAAAAGC 

II M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 M 1 1 1 M M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

GGGTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGGAGCCTGCGCCTCAAAAAGC 


1825 


Db 


301 


360 


Qy 


1826 


AGTCGTGCGAGGGCAGCTGGGAGGATGTGC-TGCACCTGGGCGAGGAGGCGCCCTCCCAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGTCGTGCGAGGGCAGCTGGGAGGATGTGCTTGCACCTGGGCGAGGAGGCGCCCTCCCAC 


1884 


Db 


361 


420 


Qy 


1885 


CTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

CTCTACTACTGCCAGCTGGAGGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGAGCCGC 


1944 


Db 


421 


480 


Qy 


1945 


TTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TATGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTG 


2004 


Db 


481 


540 


Qy 


2005 


TTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTGCATGAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 M 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

TTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATACTGGTCTACTGCCTGCATGAC 


2064 


Db 


541 


600 


Qy 


2065 


ACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATC 

II 1 1 1 II M 1 M 1 1 II 1 1 1 M M M 1 1 II 1 II II II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 

ACTCACGATGCACTCAACGTAGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATC 


2124 


Db 


601 


660 


Qy 


2125 


CAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCAC 

1 II 1 1 M 1 1 1 II II 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 II 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 

CAGGAGCCACGGGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCAC 


2184 


Db 


661 


720 


Qy 


2185 


GATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTT 

II 1 1 1 1 1 II 1 1 1 II 1 II II 1 1 1 1 1 1 M 1 1 1 II 1 1 1 M 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 M 1 1 1 1 
GATGTGCCCAGCTCCCTGTGGAAGAGTAAGCTCCTTGTCAGCTACCAGGAGATCCCCTTT 


2244 


Db 


721 


780 


Qy 


2245 


TATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTC 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTCACCCTGGAGCGTGTC 


2304 


Db 


781 


840 


Qy 


2305 


AGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 II II 1 1 1 1 I 1 1 1 M i 1 i 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 

AGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGG 


2364 


Db 


841 


900 


Qy 


2365 


CAGAGCTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 

CAGAGCTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCT 


2424 


Db 


901 


960 



Qy 


2425 


Db 


961 


Qy 


2485 


Db 


1021 


Qv 


2544 


Db 


1081 


Qv 


2603 


Db 


1141 


Qy 


2663 


Db 


1201 


Qy 


2723 


Db 


1257 



CTGGAGAGTGAAGCGGGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTC 2484 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M 

CTGGAGAGTGAAGCGNGGGTCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTC 1020 

CTCATTCGGCAGAAGATAA-TTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTG 2543 

I I I I I I I I I I I I I I I I I I I IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 
CTCATTCGGCAGAAGATAATTTTCCAGCCTGGACCCACCCTGTAGGCGGGGTGCCGACTG 1080 

GCGG-ACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGC 2602 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
GCGGAACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTCCAAGC 114 0 

CCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACC 2662 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I M I I I I I I I I I 
CCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACC 1200 

TCAGCCAGCTGGCTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAG 2722 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TCAGCCAGCTGG-TGCAGCAGTGGCTGGACTGGGC — AGCAGACGCTGG-CTCTTCACAG 1256 

TGTCGGAGGCTGAGTGCTGAGGCCGG 2748 

I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGTCGGAGGCTGAGTGCTGAGGCCGG 1282 



RESULT 12 
US-10-087-684- 
Sequence 1, 
Publication 



Application US/10087684 
No. US20040029116A1 



GENERAL INFORMATION: 

APPLIC7\NT: Edinger, Shlomit R. 

APPLICANT: MacDougall, John R. 

APPLICANT: Millet, Isabella 

APPLIC7\NT: Ellerman, Karen 

APPLICANT: Stone, David J. 

APPLICANT: Grosse, William M. 

APPLICANT: Lepley, Denise M. 

APPLICANT: Rieger, Daniel K. 

APPLICANT: Burgess, Cathereine E. 

APPLICANT: Gasman, Stacie, J. 

APPLICANT: Spytek, Kimberly A. 

APPLICANT: Boldog, Ferenc L. 

APPLICANT: Li, Li 

APPLICANT: Padigaru, Muralidhara 

APPLICANT: Mishra, Vishnu 

APPLICANT: Shenoy, Suresh G. 

APPLICANT: Rastelli, Luca 

APPLICANT: Tchernev, Velizar T. 

APPLICANT: Vernet, Corine A.M. 

APPLICANT: Zerhusen, Bryan D. 

APPLICANT: Malyankar, Uriel M. 

APPLICANT: Guo, Xiaojia 

APPLICANT: Miller, Charles E. 

APPLICANT: Gangolli, Esha A, 

TITLE OF INVENTION: PROTEINS AND NUCLEIC ACIDS ENCODING SAME 
FILE REFERENCE: 21402-214 CIP 



; CURRENT APPLICATION NUMBER: US/ 10/ 087 , 684 

; CURRENT FILING DATE: 2003-03-10 

; PRIOR APPLICATION NUMBER: 60/253,834 

; PRIOR FILING DATE: 2000-11-29 

; PRIOR APPLICATION NUMBER: 60/250,926 

; PRIOR FILING DATE: 2000-11-30 

; PRIOR APPLICATION NUMBER: 60/264,180 

; PRIOR FILING DATE: 2001-01-25 

; PRIOR APPLICATION NUMBER: 60/274,194 

; PRIOR FILING DATE: 2001-03-08 

; PRIOR APPLICATION NUMBER: 60/313,656 

; PRIOR FILING DATE: 2001-08-20 

PRIOR APPLICATION NUMBER: 60/327,456 
; PRIOR FILING DATE: 2001-10-05 
; NUMBER OF SEQ ID NOS : 220 
; SOFTWARE: CuraSeqList version 0.1 
; SEQ ID NO 1 

LENGTH: 2860 

TYPE: DNA 
; ORGANISM: Homo sapiens 
; FEATURE : 

NAME/ KEY: CDS 
; LOCATION: (59) (2857) 
US-10-087-684-1 

Query Match 34.0%; Score 936.2; DB 13; Length 2860; 

Best Local Similarity 61.7%; Fred. No. 9.1e-228; 

Matches 1662; Conservative 0; Mismatches 938; Indels 93; Gaps 7; 



Qy 143 ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 2 02 

III III II I I I I I I I I I I I I I I I M I MINI I I II I 

Db 168 ACTCCTTCCCGTCAGCGCCAGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACG 227 

Qy 203 TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 262 

I II I I I I I I I I I I M M I I I I Ml III I I I I I I II I I I I I I I I 

Db 228 CCTACATTGTGAAGAACAAGCCTGTGGAGCTCCGCTGCCGCGCCTTCCCCGCCACACAGA 287 



263 TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 322 

I I I M I I I M I I I M I I I I I I I I I I I I I I I I I I I I II I I I II II 
288 TCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCACACAGGAAGGCC 347 

323 CAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGG 382 

II I I II II I I I I I I I I I I M I I I M I I I M I I I I 

34 8 TGGATGAGGCCACCGGTCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGG 4 07 



Qy 383 TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 442 

I III II I II I II M I I I I II II M II I I I I I I I I II I I M I I I I I I I I I III 
Db 408 TGGAGGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCGG 467 

Qy 443 GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 502 

I I I I II I I I I II II I II I I II I II I I I I Ml M I I II I II M I II I I I 

Db 4 68 GCACCACCAAGAGTCGCCGAGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATC 527 

Qy 503 AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 562 

I II M I I II II II M II II II I II I I I I II I I Mill I I I I I I I I 
Db 528 AGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCATGAGGTTCTCCTGCAGTGCCGCCCGC 587 



Qy 5 63 CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 622 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 58 8 CGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGTCATCGACCCCA 647 

Qy 623 CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 682 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 64 8 CCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCC 707 

Qy 683 TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 742 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 7 08 TGTCGGACACTGCCAACTATACCTGCGTGGCCAAGAACATCGTGGCCAAACGCCGGAGCA 767 

Qy 743 CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 802 

I I I II I I I I M II I I I II II I II I I I I I II II I I II I II I I II I I 
Db 768 CCACTGCCACCGTCATCGTCTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCAC 827 

Qy 803 TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 8 62 

II II I I M II III II I II M II II M II I I I I II M I I I I M I I II I 

Db 828 CCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAAGCGCACCCGGACCTGCACCAACCCCG 887 

Qy 863 CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 922 

I II I I I I I M I II II I I I I I I I I I I III I II I I I I II I II I I I Ml 

Db 888 CTCCACTCTiACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGACCGCCTGCACCA 947 

Qy 923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 9 82 

I I I I M I I I I I I I II I I I I I II I I I I I I I I I I I I I I I M I I 

Db 948 CCATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTG 1007 

Qy 983 ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 

III I I I I I I M I I I I I II I I I I I I I II I I I II I I I I I I I I I I 
Db 1008 AGTGTGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTG 1067 

Qy 1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 

I I I I I I II I I I I I I I II M I I II I I II III I 

Db 1068 ACTGCAGCGGGACGCTGCTCGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAACTGG 1127 

Qy 1103 CTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGG 1162 

I I I I I I I I I I I I I I I III I I I I 

Db 1128 AGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGG 1187 

Qy 1163 TCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAG 1222 

III II M I I I I M M I I I I I I I I I I I I 

Db 1188 CAATCCTCATGGCGGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTCGACACAG 1247 

Qy 1223 ATGTGGCTGACTCGTCCATT CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCA 1279 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 124 8 ACATCACTGACTCATCTGCTGCCCTGACTGGTGGTTTCCACCCCGTCAACTTTAAGACGG 1307 

Qy 1280 GCAAAGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGCACCACCA 1333 

I I I I I I I I I I I I I II I I I I M I I I I I I I I 

Db 1308 CAAGGCCCAGTAACCCGCAGCTCCTACACCCCTCTGTGCCTCCTGACCTGACAGCCAGCG 1367 

Qy 1334 CCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGA 1370 

II I I I I I I I II I I I I I I I I I I I I 

Db 1368 CCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATCCCCATGA 1427 

Qy 1371 TGGGCCCAGCCCCAAGTTCCAGCTCACCA 1399 



Db 


1428 


CCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCA 


1487 


Qy 


1400 


—ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACAC ACTGCACC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 
CCACGGGCTCTGGGCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTG 


1447 


Db 


1488 


1547 


Qy 


1448 


ACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACT 

M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 II 
GCACATACCCTAGCGATTTCGCCCGGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCC 


1507 


Db 


1548 


1607 


Qy 


1508 


TCCGCTC CCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCT 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCT 


1552 


Db 


1608 


1667 


Qy 


1553 


TCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCC 

1 II 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 
TTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCCCA 


1612 


Db 


1668 


1727 


Qy 


1613 


CAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAG 

1 II III M 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 M M 1 II 1 
ATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAGGCAGAAA 


1672 


Db 


1728 


1787 


Qy 


1673 


ACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGAC 

1 M II II 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 II 1 1 M 
GTACCCTGCCGCTTTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGAC 


1732 


Db 


1788 


1847 


Qy 


1733 


CCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCA 

1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 

CCACAGGCCTCCTGCTGTGCCGCCCCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCA 


1792 


Db 


1848 


1907 


Qy 


1793 


GCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATG 

1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
GTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCAGGGCCACTGGGAGGAGG 


1852 


Db 


1908 


1967 


Qy 


1853 


TGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTG 

1 1 II 1 1 1 II M II 1 1 M II 1 M 1 1 M 1 

TGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGG 


1912 


Db 


1968 


2027 


Qy 


1913 


CCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCG 

1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 
CCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCC 


1972 


Db 


2028 


2087 


Qy 


1973 


TGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCG 

1 1 1 1 1 II II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCTCAGCAGTCAAGCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGG 


2032 


Db 


2088 


2147 


Qy 


2033 


AGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGC 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 II 1 1 M 
AGTACAGCCTCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGG 


2092 


Db 


2148 


2207 


Qy 


2093 


AGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGG 

1 1 II 1 1 II 1 1 1 II 1 III 1 III 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 
AGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGG 


2152 


Db 


2208 


2267 


Qy 


2153 


ACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTA 
1 1 1 11 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 


2212 



Db 


2268 


ACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCA 


2327 


Qy 


2213 


AGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGT 

1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 111 1 
AGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGG 


2272 


Db 


2328 


2387 


Qy 


2273 


ACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCA 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 M II 1 1 

CCCTCCACTGCACTTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCA 


2332 


Db 


2388 


2447 


Qy 


2333 


AGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCA 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 III 1 1 M 1 
AGATCTGCGTGCGGCAAGTGGAAGGGGAGGGCCAGATATTCCAGCTGCATACCACTCTGG 


2392 


Db 


2448 


2507 


Qy 


2393 


CCAAG GACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAG 

III III II II 1 II 1 1 1 1 
CAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACTGTCACCA 


2449 


Db 


2508 


2567 


Qy 


2450 


CCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCA 

1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 M 1 1 1 M 1 1 II M II 1 1 1 1 1 1 1 II 
CCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAACA 


2509 


Db 


2568 


2627 


Qy 


2510 


GCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACC 

1 1 1 1 1 1 1 1 1 1 1 II II 1 II 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
GCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTA 


2569 


Db 


2628 


2687 


Qy 


2570 


TGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACC 

MINI 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 III 
TGGACCGGTACCTGAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACC 


2629 


Db 


2688 


2747 


Qy 


2630 


TGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 1 
1 1 1 M 1 M 1 1 1 1 1 1 1 1 i M 1 M 1 Mill II III 1 

TCTGGGAAGCTCTGCAGCAGGACGATGGGGACCTCAACAGCCTGGCGAGTGCCTTGGAGG 


2689 


Db 


2748 


2807 


Qy 


2690 


GACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

II 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 
AGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGGGGACTGCTGA 2860 




Db 


2808 





RESULT 13 
US-10-087-684-3 

Sequence 3, Application US/10087684 
Publication No. US2 0040029116A1 
GENERAL INFORMATION: 
APPLICANT: Edinger, Shlomit R. 
APPLICANT: MacDougall, John R. 
APPLICANT: Millet, Isabelle 
APPLICANT: Ellerman, Karen 
APPLICANT: Stone, David J. 
APPLICANT: Grosse, William M. 
APPLICANT: Lepley, Denise M. 
APPLICANT: Rieger, Daniel K. 
APPLICANT: Burgess, Cathereine E. 
APPLICJWT: Gasman, Stacie, J. 
APPLIC7\NT: Spytek, Kimberly A. 
APPLICANT: Boldog, Ferenc L. 
APPLICANT: Li, Li 



APPLICANT: Padigaru, Muralidhara 
APPLICTM^T: Mishra, Vishnu 
APPLICANT: Shenoy, Suresh G. 
APPLICANT: Rastelli, Luca 
APPLICANT: Tchernev, Velizar T, 
APPLICANT: Vernet^ Corine A.M. 
APPLICANT: Zerhusen^ Bryan D. 
APPLICANT: Malyankar, Uriel M. 
APPLICANT: Guo, Xiaojia 
APPLICANT: Miller, Charles E. 
APPLICANT: Gangolli, Esha A. 

TITLE OF INVENTION: PROTEINS T^D NUCLEIC ACIDS ENCODING SAME 
FILE REFERENCE: 21402-214 CIP 
CURRENT APPLICATION NUMBER: US/10/087,684 
CURRENT FILING DATE: 2003-03-10 
PRIOR APPLICATION NUMBER: 60/253,834 
PRIOR FILING DATE: 2000-11-29 
PRIOR APPLICATION NUMBER: 60/250,926 
PRIOR FILING DATE: 2000-11-30 
PRIOR APPLICATION NUMBER: 60/264,180 
PRIOR FILING DATE: 2001-01-25 
PRIOR APPLICATION NUMBER: 60/274,194 
PRIOR FILING DATE: 2001-03-08 
PRIOR APPLICATION NUMBER: 60/313,656 
PRIOR FILING DATE: 2001-08-20 
PRIOR APPLICATION NUMBER: 60/327,456 
PRIOR FILING DATE: 2001-10-05 
NUMBER OF SEQ ID NOS : 220 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 3 
LENGTH: 2860 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (59) . . (2857) 
US-10-087-684-3 

Query Match 34.0%; Score 936.2; DB 13; Length 2860; 

Best Local Similarity 61.7%; Pred. No. 9.1e-228; 

Matches 1662; Conservative 0; Mismatches 938; Indels 93; Gaps 7; 

ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 202 

III III II I II I I I I I I I I I M M M I I I I II I II I I 

ACTCCTTCCCGTCAGCGCCAGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACG 227 

TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 2 62 

I I I I I II I I I I I I I I I I I I I I II I I III I M I I I I II I I I M I I 

CCTACATTGTGAAGAACAAGCCTGTGGAGCTTCGCTGCCGCGCCTTCCCCGCCACACAGA 287 

TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 322 
III I II I I I II I I I I II I I I I I I I I I I I II I II I I I I I I I II II 



I I I I M I I II I I I I I I I I II I I I I I I I I II I I I I 



Qy 


143 


Db 


168 


Qy 


203 


Db 


228 


Qy 


263 


Db 


288 


Qy 


323 


Db 


348 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



383 



408 



443 



468 



503 



528 



563 



588 



623 



648 



683 



708 



743 



768 



803 



828 



863 



923 



948 



TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 

I Ml II I II M I I I I I I I II I I I I I I I I M M I I I I I I M I I I I I II I I I I 

TGGAGGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCAG 

GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 

M II I M II I II I II I II M I I I I I I I I III I II I I I I I I I I I II I I I 

GCACCACCAAGAGTCGCCGAGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATC 

AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 

I I I I I I I I II I I I I I I II I I I I I I I II I I I I I I I I M I I I I I I I I 
AGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCATGAGGTTCTCCTGCAGTGCCGCCCGC 

CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 

I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
CGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGTCATCGACCCCA 

CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I 

CCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCC 

TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I II I I I I II I III 

TGTCGGACACTGCCAACTATACCTGCGTGGCCAAGAACATCGTGGCCAAACGCCGGAGCA 

CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I 
CCACTGCCACCGTCATCGTCTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCAC 

TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I II I I I I I I I I I 
CCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAAGCGCACCCGGACCTGCACCAACCCCG 

CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGA7\AACAGCCTGCGCCA 

I II I I I I I I I I II II II I I I I I II I III I I I I I I I II I I I I I I III 

CTCCACTCAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGACCGCCTGCACCA 



442 



467 



502 



527 



562 



587 



622 



647 



682 



707 



742 



767 



802 



827 



862 



887 



922 



947 



982 



CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 

I I I I I M I I I I I I I I I I II I I I I I II I II I I I M I II M I I 

CCATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTG 1007 



983 ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 

III I I I I I I II I I I M I I I I I I I I I II I I II I I II I I I I I I I 
1008 AGTGTGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTG 1067 

1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 

I I I I I I I I I I I I I I I I II I I I II I I I I III I 

1068 ACTGCAGCGGGACGCTGCTCGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAACTGG 1127 

1103 CTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGG 1162 

I I I I I I I I I I I I M I IN I I I I 

112 8 AGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGG 1187 

1163 TCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAG 1222 

IN II II I I I I M I M I I I I I I I I I I I 

1188 CAATCCTCATGGCGGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTCGACACAG 1247 



122 3 ATGTGGCTGACTCGTCCATT CTCACCTCAGGCTTCCAGCCCGTCAGCATCi\AGCCCA 1279 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

1248 ACATCACTGACTCATCTGCTGCCCTGACTGGTGGTTTCCACCCCGTCAACTTTAAGACGG 1307 

1280 GCAAAGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGCACCACCA 1333 

I I I I I I I I I I I I I II II I I I I I I I I I I I I 

1308 CAAGGCCCAGTAACCCGCAGCTCCTACACCCCTCTGTGCCTCCTGACCTGACAGCCAGCG 1367 

1334 CCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGA 137 0 

II II III M II I I I I I I I I I I I I 

1368 CCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATCCCCATGA 1427 

1371 TGGGCCCAGCCCCAAGTTCCAGCTCACCA 1399 

I I I I I I I I I I I I I I I I II 
1428 CCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCA 1487 

1400 — ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACAC ACTGCACC 1447 

I I II I I I I II I I I I M I I I I III 

1488 CCACGGGCTCTGGGCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTG 1547 

1448 ACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACT 1507 

I I I I I I I I II I I I I I I I II I I I II 

1548 GCACATACCCTAGCGATTTCGCCCGGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCC 1607 

1508 TCCGCTC CCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCT 1552 

Mill I I I I I I II I I I I I I I II I 

1608 TCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCT 1667 

1553 TCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCC 1612 

I II I I I I I I I I I I I I Mill I I M I II I II I M MM 

1668 TTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCCCA 1727 
1613 CAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAG 1672 

I I M I I M II II II I I II I II M I M II I II I M I II I 

1728 ATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAGGCAGAAA 17 87 

1673 ACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGAC 1732 

I II II II Mill I I I I I M I II I II II I II I 

1788 GTACCCTGCCGCTTTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGAC 1847 

1733 CCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCA 1792 

II I I I I I I II M I M I II I II II M I I I I I II II I II I II II 
1848 CCACAGGCCTCCTGCTGTGCCGCCCCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCA 1907 

1793 GCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATG 1852 

I I M II I I I I I I II I I MM I II I II I M II II I II 
1908 GTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCAGGGCCACTGGGAGGAGG 1967 

1853 TGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTG 1912 

II II Mill II II II M I II II M I II II II II I II II I II I 
1968 TGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGG 2027 

1913 CCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCG 1972 

II I I I II II I II II M II II II I I M II M M I 
2028 CCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCC 2087 

1973 TGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCG 2 032 



I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 2088 GCTCAGCAGTCAAGCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGG 2147 

Qy 2033 AGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGC 2 092 

I M I I I I M I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 214 8 AGTACAGCCTCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGG 22 07 

Qy 2093 AGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGG 2152 

I I I I I I M I I I I I I 1 I I I III I I M II I II I I I I I I I 

Db 2208 AGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGG 2267 

Qy 2153 ACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTA 2212 

I I II I I I I I I I I M I I I I I I I I I II I I II I I I II I III I I I I I II I 

Db 2268 ACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCA 2327 

Qy 2213 AGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGT 2272 

M I I I I I II I I I I I I I I II I I I I I I I I I M I I II I I I I I I I I I III I 
Db 2328 AGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGG 2387 

Qy 2273 ACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCA 2332 

I I M I I I I I I I I M I I I I I I I I I I I I I III I I I I I I I I I I 

Db 238 8 CCCTCCACTGCACTTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCA 2447 

Qy 2333 AGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCA 2392 

II I I I M I I I I I M I I I I I I I I I I I I I III I I II I 

Db 2448 AGATCTGCGTGCGGCAAGTGGAAGGGGAGGGCCAGATATTCCAGCTGCATACCACTCTGG 2507 

Qy 2393 CCAAG GACACTVAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAG 2449 

III III II II I II I I I I 

Db 2508 CAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACTGTCACCA 2567 

Qy 2450 CCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCA 2509 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II 
Db 2568 CCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAACA 2627 

Qy 2510 GCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACC 2569 

I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 2628 GCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTA 2687 

Qy 2570 TGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACC 2629 

II I I I I I I I I I I I I II I I I I I I I I I I I I M I I II I I I M III 

Db 2688 TGGACCGGTACCTGAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACC 2747 

Qy 2630 TGTGGGAGGCGCGGCACTTCCCCAACGGC7\ACCTCAGCCAGCTGGCTGCAGCAGTGGCTG 2689 

I II I II I I I I I I I I I I I II I I I I I I I I I M III I 

Db 274 8 TCTGGGAAGCTCTGCAGCAGGACGATGGGGACCTCAACAGCCTGGCGAGTGCCTTGGAGG 2 8 07 

Qy 2690 GACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

I I I I I I I II I I I I I I I I I I II I I 

Db 2808 AGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGGGGACTGCTGA 2860 



RESULT 14 

US-10-218-779-1 

; Sequence 1, Application US/10218779 
; Publication No. US20040029222A1 
; GENERAL INFORMATION: 



APPLICANT: Edinger, Shlomit 
APPLICANT: MacDougall, John 
APPLICANT: Millet, Isabelle 
APPLICANT: Ellerman, Karen 
APPLICANT: Stone, David 
APPLICANT: Gerlach, Valerie 
APPLICANT: Grosse, William 
APPLICANT: Alsobrook II, John 
APPLICANT: Lepley, Denise 
APPLICANT: Rieger, Daniel 
APPLICANT: Burgess, Catherine 
APPLICANT: Gasman, Stacie 
APPLICANT: Spytek, Kimberly 
APPLICANT: Boldog, Ferenc 
APPLICANT: Li, Li 

APPLICANT: Padigaru, Muralidhara 
APPLICANT: Mishra, Vishnu 
APPLICANT: Patturajan, Meera 
APPLICANT: Shenoy, Suresh 
APPLICANT: Rastelli, Luca 
APPLICANT: Tchernev, Velizar 
APPLICANT: Vernet, Corine 
APPLICANT: Zerhusen, Bryan 
APPLICANT: Malyankar, Uriel 
APPLICANT: Guo, Xiaojia 
APPLICANT: Miller, Charles 
APPLICANT: Gangolli, Esha 

TITLE OF INVENTION: Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-214 

CURRENT APPLICATION NUMBER: US/10/218,779 
CURRENT FILING DATE: 2002-08-14 
PRIOR APPLICATION NUMBER: 60/253,834 
PRIOR FILING DATE: 2000-11-29 
PRIOR APPLICATION NUMBER: 60/250,-926 
PRIOR FILING DATE: 2000-11-30 
PRIOR APPLICATION NUMBER: 60/264,180 
PRIOR FILING DATE: 2001-01-25 
PRIOR APPLICATION NUMBER: 60/313,656 
PRIOR FILING DATE: 2001-08-20 
PRIOR APPLICATION NUMBER: 60/327,456 
PRIOR FILING DATE: 2001-10-05 
NUMBER OF SEQ ID NOS : 216 
SOFTWARE: Patentin Ver. 2.1 
SEQ ID NO 1 
LENGTH: 2860 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-218-779-1 



Query Match 34.0%; Score 936.2; DB 13; Length 2860; 

Best Local Similarity 61.7%; Pred. No. 9.1e-228; 

Matches 1662; Conservative 0; Mismatches 938; Indels 93; Gaps 



7; 



Qy 



Db 



143 ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 2 02 

III IN M I II I I II I I I I I I II M I I I II II II I I I 

168 ACTCCTTCCCGTCAGCGCCAGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACG 227 



Qy 


203 


TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III III III 1 1 II 1 1 1 1 1 1 1 II 
CCTACATTGTGAAGAACAAGCCTGTGGAGCTCCGCTGCCGCGCCTTCCCCGCCACACAGA 


262 


Db 


228 


287 


Qy 


263 


TCTTCTTCAAGTGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 

1 II 1 II 1 II II II 1 1 1 M II II II II 1 1 II 1 1 1 1 1 II 1 1 1 II M 
TCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCACACAGGAAGGCC 


322 


Db 


288 


347 


Qy 


323 


CAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCi^AGGCAGCAGG 

1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 
TGGATGAGGCCACCGGTCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGG 


382 


Db 


348 


407 


Qy 


383 


TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 

1 III II 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 II 1 11 1 II II II M 1 1 1 1 1 1 1 III 
TGGAGGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCGG 


442 


Db 


408 


467 


Qy 


443 


GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 

1 M 1 II II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCACCACCAAGAGTCGCCGAGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATC 


502 


Db 


468 


527 


Qy 


503 


AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCATGAGGTTCTCCTGCAGTGCCGCCCGC 


562 


Db 


528 


587 


Qy 
Db 


563 
588 


CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 

1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 
CGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGTCATCGACCCCA 


622 
647 


Qy 


623 


CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 
CCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCC 


682 


Db 


648 


707 


Qy 

Db 


683 
708 


TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 

1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
TGTCGGACACTGCCAACTATACCTGCGTGGCC7\AGAACATCGTGGCCAAACGCCGGAGCA 


742 
767 


Qy 


743 


CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCACTGCCACCGTCATCGTCTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCAC 


802 


Db 


768 


827 


Qy 


803 


TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 II 1 1 1 II 1 1 1 1 1 1 M 1 1 1 II II 1 1 1 1 
CCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAAGCGCACCCGGACCTGCACCAACCCCG 


862 


Db 


828 


887 


Qy 


863 


CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 

1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 III M 1 1 1 1 1 II 1 1 1 1 1 1 III 
CTCCACTCAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGACCGCCTGCACCA 


922 


Db 


888 


947 


Qy 


923 


CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 

II 1 1 1 1 1 1 1 1 1 II II 1 1 II 1 II II 1 II II II 1 II 1 1 1 1 1 1 1 
M 1 II II II 1 1 M II 1 1 II II 1 II 1 1 1 1 1 1 1 1 II Mill 1 1 

CCATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTG 


982 


Db 


948 


1007 


Qy 


983 


ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 

III 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
AGTGTGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTG 


1042 


Db 


1008 


1067 



Qy 1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 



I 1 1 1 1 1 1 1 Mini M I II II II II 1 1 III I 

Db 1068 ACTGCAGCGGGACGCTGCTCGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAACTGG 1127 

Qy 1103 CTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGG 1162 

I I I I I I II II I I I I I III I I I I 

Db 1128 AGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGG 1187 

Qy 1163 TCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGGGCTGGACTCAG 1222 

III II M I I II I I I I I I M I I I I I I I I 

Db 1188 CAATCCTCATGGCGGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTCGACACAG 1247 

Qy 1223 ATGTGGCTGACTCGTCCATT CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCA 1279 

I I I II I I II I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 124 8 ACATCACTGACTCATCTGCTGCCCTGACTGGTGGTTTCCACCCCGTCAACTTTAAGACGG 1307 

Qy 1280 GCAAAGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGCACCACCA 1333 

I I I I I I I I I I I I I II II I I I I I I I I I I I I 

Db 1308 CAAGGCCCAGTAACCCGCAGCTCCTACACCCCTCTGTGCCTCCTGACCTGACAGCCAGCG 1367 

Qy 1334 CCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGA 1370 

II I I I I I I I II I I I M I II I I I I 

Db 1368 CCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATCCCCATGA 1427 

Qy 1371 TGGGCCCAGCCCCAAGTTCCAGCTCACCA 1399 

I I I I I I I I I I I I M I I II 
Db 1428 CCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCA 1487 

Qy 14 00 — ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACAC ACTGCACC 1447 

I I I I I I I M I I I I I I M I I I Ml 

Db 148 8 CCACGGGCTCTGGGCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTG 1547 

Qy 1448 ACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACT 1507 

I I I I I I I I I I I I II I M I I I I I II 

Db 1548 GCACATACCCTAGCGATTTCGCCCGGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCC 1607 

Qy 1508 TCCGCTC CCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCT 1552 

I I I I I I I I I I I I I I I I I I MM I I I I II I 

Db 1608 TCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCT 1667 

Qy 1553 TCAACTTCCTCGGGGGCCGGCTGATGATCCCT7\ATACAGGTATCAGCCTCCTCATCCCCC 1612 

I I II I II II I Mill I II I I I II I I M I I II II II I I 

Db 1668 TTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCCCA 1727 

Qy 1613 CAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAG 1672 

I II I I I I II I II I I I I I I II I II II I I I I I I II I II II 

Db 1728 ATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAGGCAGAAA 1787 

Qy 1673 ACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGAC 1732 

I II II II I I III I II I I M I I II I I I II II I 

Db 17 88 GTACCCTGCCGCTTTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGAC 1847 

Qy 1733 CCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCA 1792 

II II M M II I II I I I II II I I I I II II I I II I I II II M II 

Db 184 8 CCACAGGCCTCCTGCTGTGCCGCCCCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCA 1907 

Qy 1793 GCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATG 1852 

I I II I I II I I II II II I I II II I I I I M I I II I I II 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



19 08 GTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCAGGGCCACTGGGAGGAGG 1967 
1853 TGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTG 1912 

MM M I M M M M I I I I M M M M M M I M M I M I I I 

1968 TGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGG 2027 

1913 CCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCG 1972 

MM M M I M M M M M I M I I M I M I M I 

2028 CCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCC 2 0 8 7 
1973 TGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCG 2032 

I M I M M M I M M M I I M M M I M M M I M M I I 
2088 GCTCAGCAGTCAAGCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGG 2147 

2033 AGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGC 2092 

M M M I M M M M M M M M I I M M I I I M M I M M M M I M 

214 8 AGTACAGCCTCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGG 22 07 
2093 AGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGG 2152 

M M M M I M M I M I I Ml M M M I M M M M I 

2208 AGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGG 2267 
2153 ACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTA 2212 

M M M M M M M M M I M I M I M M M MM Ml II II M I I 

2268 ACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCA 2327 
2213 AGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGT 2272 

II II II I M M I II M II II I M I M I I II I I II II II II I I I Ml I 

2328 AGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGG 2387 
2273 ACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCA 2332 

I I M I M M I I I I II II II I II I MM III II I I II II I I 

2388 CCCTCCACTGCACTTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCA 24 4 7 

2333 AGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCA 2392 

II I II II I II II II I M I I II I I M M III I I II I 

2448 AGATCTGCGTGCGGCAAGTGGAAGGGGAGGGCCAGATATTCCAGCTGCATACCACTCTGG 2507 

2393 CCAAG GACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGGTCCCAG 2449 

III III II II I II II I I 

2508 CAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACTGTCACCA 2 567 

2450 CCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCA 2509 

II I I II II I I II II I II II I I II II I I M II I II I I I I II II 
2568 CCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAACA 2627 

2510 GCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACC 2569 

II II M I II I Mill I I I II II M I I II II I I I II I 

2628 GCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTA 2 687 

2570 TGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACC 2629 

I II II I I II I I I I II II I II M I I I I II II I I I II II II Ml 
2688 TGGACCGGTACCTGAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACC 2747 

2630 TGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTG 2689 

I I I M I M I I I I I I I I II I II I I I II I I II III I 

2748 TCTGGGAAGCTCTGCAGCAGGACGATGGGGACCTCAACAGCCTGGCGAGTGCCTTGGAGG 2807 



Qy 2690 GACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

M I I I I I II I I I I I I I I M I I I I 

Db 2 808 AGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGGGGACTGCTGA 2860 
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Query Match 34.0%; Score 936.2; DB 13; Length 2860; 

Best Local Similarity 61.7%; Pred. No. 9.1e-228; 

Matches 1662; Conservative 0; Mismatches 938; Indels 93; Gaps 7; 



Qy 


143 


ACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCGAGGATG 

III III II 1 1 1 1 1 M 1 M 1 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 
ACTCCTTCCCGTCAGCGCCAGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACG 


202 


Db 


168 


227 


Qy 


203 


TGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGA 

II M 1 II II II II II III II 1 Ml! Ill 1 1 1 1 M 1 1 1 1 1 1 1 1 M 

CCTACATTGTGAAGAACAAGCCTGTGGAGCTTCGCTGCCGCGCCTTCCCCGCCACACAGA 


262 


Db 


228 


287 


Qy 


263 


TCTTCTTCAAGTGCT^ACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 II II 
TCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCACACAGGAAGGCC 


322 


Db 


288 


347 


Qy 


323 


CAGACGGGAGCAGTGGGCTGCCCACCATGGAGGTCCGCATTAATGTCTCAAGGCAGCAGG 

1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 II Mill 1 1 1 1 1 1 1 1 1 

TGGATGAGGCCACCGGCCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGG 


382 


Db 


348 


407 


Qy 


383 


TCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGGCATGGAGCTCCTCGG 

1 III II 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TGGAGGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCAG 


442 


Db 


408 


467 


Qy 
Db 


443 
468 


GCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCAGATTGCGCAAGAACTTCGAGC 

1 1 1 1 1 1 1 M 1 M 1 1 1 II 1 1 1 1 1 II 1 1 1 1 III 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

GCACCACCAAGAGTCGCCGAGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATC 


502 

527 


Qy 


503 


AGGAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCATGAGGTTCTCCTGCAGTGCCGCCCGC 


562 


Db 


528 


587 


Qy 


563 


CGGAGGGCATCCCTCCAGCCGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGT 

M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

CGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGTCATCGACCCCA 


622 


Db 


588 


647 


Qy 


623 


CCCTGGACCCCAATGTATACATCACGCGGGAGCACAGCCTGGTGGTGCGACAGGCCCGCC 

M 1 1 1 1 1 M 1 1 1 MM M 1 1 1 1 II 1 1 1 1 1 II 11 II M 1 1 

CCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCC 


682 


Db 


648 


707 


Db 


683 
708 


TTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGCCGCAGCG 

1 1 Mill M 1 II II 1 1 1 M II M 1 1 1 II 1 II II II M 1 II 1 II 1 II III 

TGTCGGACACTGCCAACTATACCTGCGTGGCCAAGAACATCGTGGCCAAACGCCGGAGCA 


742 
767 


Qy 


743 


CCTCCGCTGCTGTCATCGTCTACGTGAACGGTGGGTGGTCGACGTGGACCGAGTGGTCCG 

1 1 1 II 1 II 1 II M M II II 1 II 1 II II 1 M M 1 1 1 1 1 II 1 II 1 1 1 
CCACTGCCACCGTCATCGTCTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCAC 


802 


Db 


768 


827 


Qy 


803 


TCTGCAGCGCCAGCTGTGGGCGCGGCTGGCAGAAACGGAGCCGGAGCTGCACCAACCCGG 

II II 1 1 1 II 1 II 1 II II 1 M M II II M 1 1 1 II 1 II II 1 1 1 II 1 1 1 1 
CCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAAGCGCACCCGGACCTGCACCAACCCCG 


862 


Db 


828 


887 


Qy 


863 


CGCCTCTCAACGGGGGCGCTTTCTGTGAGGGGCAGAATGTCCAGAAAACAGCCTGCGCCA 


922 



I II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 11 



888 CTCCACTCAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGACCGCCTGCACCA 947 

923 CCCTGTGCCCAGTAGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCTGG 982 

I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I 

948 CCATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTG 1007 

983 ACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGG 1042 

III II M I I I II I M II I I I I I I I I II I I I I I I I II I I II I I 

1008 AGTGTGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTG 1067 

1043 AGTGCCAGGGCACTGACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACAGTG 1102 

MM II II I I I I I I I I I I I I I II I I I I III I 

1068 ACTGCAGCGGGACGCTGCTCGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAACTGG 1127 

1103 CTTCTGGCCCTGAGGACGTGGCCCTCTATGTGGGCCTCATCGCCGTGGCCGTCTGCCTGG 1162 

I I I I I I I I I I I I I I I IN MM 

1128 AGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGG 1187 

1163 TCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGG7\AGAAGGAGGGGCTGGACTCAG 1222 

II I II II I I II I I I M II I I I M M II 
1188 CAATCCTCATGGCGGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTCGACACAG 1247 

1223 ATGTGGCTGACTCGTCCATT CTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCA 1279 

I I II II M I I II II II II I II 

124 8 ACATCACTGACTCATCTGCTGCCCTGACTGGTGGTTTCCACCCCGTCAACTTTAAGACGG 1307 

1280 GCAAAGCAGACAACCCCCATCTGCT CACCATCCAGCCGGACCTCAGCACCACCA 1333 

I I I II I II II II I II II II I II II I Ml! 

1308 CAAGGCCCAGTAACCCGCAGCTCCTACACCCCTCTGTGCCTCCTGACCTGACAGCCAGCG 1367 

1334 CCACCACCTACCAGGGCAGTCTCTGTCCCCGGCAGGA 1370 

II II I M I I II I II M II I I M I 

1368 CCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATCCCCATGA 1427 

1371 TGGGCCCAGCCCCAAGTTCCAGCTCACCA 1399 

II II I II II II I M M M 
1428 CCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCA 1487 

1400 — ATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACAC ACTGCACC 1447 

II II M I M I I I I I I II M I IN 

14 88 CCACGGGCTCTGGGCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTG 1547 

1448 ACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACT 1507 

II I I I I I I II I I II I II I I I I I II 

1548 GCACATACCCTAGCGATTTCGCCCGGGACACCCACTTCCTGCACCTGCGCAGCGCCAGCC 1607 

1508 TCCGCTC CCTGCCCCGAGGCACCAGCAACATGACCTATGGGACCT 1552 

Mill II II II II I I II I MM I Mini 

1608 TCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAGCAGCGTCAGCGGCACCT 1667 
1553 TCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGTATCAGCCTCCTCATCCCCC 1612 

I M II II I II Mill Mill I II M II I M II I I I II 
1668 TTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCCCA 1727 

1613 CAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAG 1672 

I II I II II M I II M II II II M I I I M I II II I I I II 

1728 ATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAGGCAGAAA 17 87 



Qy 


1673 


ACGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGAC 

1 II II II Mill 1 II II II 1 M 1 1 1 1 1 1 i 1 1 

GTACCCTGCCGCTTTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGAC 


1732 


Db 


1788 


1847 


Qy 


1733 


CCCCTGGCGTCCTGCTCACCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCA 

II 1 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 M 1 II 1 1 1 1 1 M 1 1 11 1 M II 
CCACAGGCCTCCTGCTGTGCCGCCCCGTCATCCTCACCATGCCCCACTGTGCCGAAGTCA 


1792 


Db 


1848 


1907 


Qy 


1793 


GCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAGTCGTGCGAGGGCAGCTGGGAGGATG 

1 1 1 1 1 II 1 1 1 1 1 II 1 1 INI 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 
GTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCAGGGCCACTGGGAGGAGG 


1852 


Db 


1908 


1967 


Qy 


1853 


TGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGAGGCCAGTG 

MM Mill 1 II 1 1 II 1 1 II 1 M II 1 1 1 1 II 1 M 1 1 1 

TGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAGGG 


1912 


Db 


1968 


2027 


Qy 

J: 


1913 


CCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCG 

II 1 1 II II 1 1 1 1 M II M 1 1 II 1 1 1 II II 1 II 1 

CCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCC 


1972 


Db 


2028 


2087 


Qy 


1973 


TGGCTGCCGCCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCG 

1 M 1 II II II II 1 M 1 M 1 II 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 

GCTCAGCAGTCAAGCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGG 


2032 


Db 


2088 


2147 


Qy 


2033 


AGTACAACATCCGGGTCTACTGCCTGCATGACACCCACGATGCACTCAAGGAGGTGGTGC 

1 M II 1 1 1 1 II M II 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 II 1 1 1 II 

AGTACAGCCTCCGGGTCTACTGCCTGGAGGACACGCCTGTAGCACTGAAGGAGGTGCTGG 


2092 


Db 


2148 


2207 


Qv 


2093 


AGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACGGGTCCTGCACTTCAAGG 

1 II II II 1 1 M II 1 t II 1 III 1 II 1 II 1 II II 1 1 1 1 1 
AGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAAACCGCTAATGTTCAAGG 


2152 


Db 


2208 


2267 


Qv 


2153 


ACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTA 

1 1 1 1 1 1 M M 1 II 1 M II M II II 1 II 1 1 1 1 1 1 1 1 Ml II 1 1 1 1 1 1 

ACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAGCA 


2212 


Db 


2268 


2327 


Qy 


2213 


AGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGT 

1 M 1 II 1 II II M M M II 1 II 1 1 1 1 II M M 1 1 1 1 M II 11 1 III 1 

AGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGG 


2272 


Db 


2328 


2387 


Qv 


2273 


ACTTGCACTGCACCTTCACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCA 

1 1 II 1 II II 1 II II II 1 II M 1 1 II 1 1 III II 1 1 II 1 1 II 

CCCTCCACTGCACTTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCA 


2332 


Db 


2388 


2447 


Qy 


2333 


AGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCAGAGCTTCAGCATCAACTTCAACATCA 

M 1 II II 1 1 M 1 1 M 1 1 II 1 1 II M 1 1 III 1 1 II 1 
AGATCTGCGTGCGGCAAGTGGAAGGGGAGGGCCAGATATTCCAGCTGCATACCACTCTGG 


2392 


Db 


2448 


2507 


Qy 


o o n o 

2333 




2449 


Db 


2508 


III III M II 1 II INI 

CAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCCTGGCAGCACTGTCACCA 


2567 


Qy 


2450 


CCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCA 

II 1 1 II M II II II II II 1 II 1 M 1 1 1 1 1 M 1 II 1 M II 1 II 
CCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAACA 


2509 


Db 


2568 


2627 



Qv 2 510 GCCTGGACCCACCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACC 2569 

I I I I I I I I I I Mill I II II I I I II I 

Db 2628 GCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTA 2687 

Qy 2570 TGGACAGCCATCTCAGCTTCTTTGCCTCCAAGCCCAGCCCCACAGCCATGATCCTCAACC 2629 

lllll I I II I I IMIIIi Mil 1 IMMMI I IIMIII III 

Db 2688 TGGACCGGTACCTGAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACC 2747 

Qy 2630 TGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGGCTGCAGCAGTGGCTG 2689 

I I I I I I M I I II MM M M I I I Mill II III I 

Db 2748 TCTGGGAAGCTCTGCAGCAGGACGATGGGGACCTCAACAGCCTGGCGAGTGCCTTGGAGG 2807 

Qy 2690 GACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGA 2742 

Mill II II I I I II I II IMMI 

Db 2808 AGATGGGCAAGAGTGA6ATGCTGGTGGCTGTGGCCACCGACGGGGACTGCTGA 2860 
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